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(54) Fowlpox virus promoters 

(57) Fowlpox virus (FPV) promoter DNA for use in expressing a foreign gene inserted in a FPV vector by homologous 
recombination, which comprises the promoter of any of the foltowing FPV genes:- 

(1) The FP4b gene which encodes a protein of about 657 amino acids in a sequence beginning 

Met Glu Ser Asp Ser Asn lie Ala He GIu 
G!u Val Lys Tyr Pro Asn lie Leu Leu Glu; 

(2) The BamHI fragment ORF8 gene encoding a protein of about 116 amino acids in a sequence beginning 

Met Glu Glu Gfy Lys Pro Arg Arg Ser Ser 
Ala Val Leu Tip Met Lou lie Pro Cys Giy; 

(3) The flamHI fragment ORF5 gene encoding a protein of about 105 amino acids in a sequence beginning 

Met lie lie Arg Arg Asn Asn Lys Ala Leu 
GVSerValMetSorAspPhefleLysP , 

(4) The flamHI fragment ORF10 gene encoding a protein of about 280 amino acids in a sequence beginning 

Met Lys Phe Lys Glu Val Arg Asn Thr lie 
Lys Lys Met Asn lie Thr Asp lie Lys He; and 

(5) The^gene of which thfrcbtf ilfg st and' K jfericJise s strongly to Ff*V RNA and is at least partly located within an 
approximately 790 bp DNA sequence, containing near its 5'-end the sequence: 

(ff) TGTCATCATA TCCACCTATA AATGTAATAT and naar its 3'-end the sequence: 
AAGAATAGTC TAAATTACCT AACATAG AAC ATCAT (3? 



At least one drawing original!/ filed was informal and the print reproduced here is taken from a later (Bed formal copy. 
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FOWLPOX VIRUS PROMOTERS 
Background of the Invention. 
1. Field of the Invention. 

The Invention is In the field of recombinant DNA technology 
and relates to promoters useful for the expression of foreign DNA 

05 Inserted into a fowl pox virus vector. 
' 2. Description of the prior art. 

Poxviruses are large viruses with a complex morphology 
containing linear double-stranded DNA genomes. They are among 
the few groups of DNA viruses that replicate within the cytoplasm 

10 of the cell. They are subclasslf led into six genera: orthopox- 
viruses, . avl poxviruses, capripoxviruses, ' leportpoxvlruses , 
parapoxvi ruses and entomopoxvi ruses . Vaccinia virus, an 
orthopoxvirus, is the most widely studied of the poxviruses, and 
is the subject of U.S. Patent 4,603,1 12 (Paolettl et aj..,). 

15 Fowlpox virus is an avipoxvirus or avian poxvirus. 

Recent advances in recombinant DNA technology have allowed 
vaccinia virus to be used as a vector to carry and express 
foreign genes. For a review see M. Mackett & G.L. Smith, Journal 
of General Virology 67, 2067-2082 <1985>. Certain properties of 

20 vaccinia virus make it suitable for this purpose. Firstly, it 
tolerates large amounts of extra DNA in Its genome, at least up 
to 25.000 base pairs. Secondly, it encodes Its own RNA 
polymerase which specifically initiates transcription of 
messenger RNA, beginning at the viral promoter sequences on the 

25 DNA genome. The host ce.ll RNA polymerase II does not recognise 
these viral promoters, nor does the vaccinia RNA polymerase 
transcribe from promoters recognised by the host cell RNA 
polymerase. These two properties allow foreign genes to be 
Inserted into the vaccinia virus genome under the control of a 

30 vaccinia virus promoter. Because of the very large size of the 
vaccinia virus genome (186,000 base pairs) and the fact that the 
DNA alone Is not infectious, conventional recombinant DNA 
techniques of restriction enzyme cleavage and ligation of DNA 
fragments Into the genome are not technically feasible. 
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Therefore DNA Is Introduced Into the genome by a process of 
homologous recombination. Homologous recombination Involves 
essentially (1) pre-selecting a length of the vaccinia virus (VV) 
genome in some region which does not impair the replication and 
' 05 normal functioning of the virus (hereinafter called a 
"non-essential region"), (2) making a construct of a length of 
foreign DNA in a copy of the non-essential region so that the 
foreign DNA is flanked by extensive sequences of non-essential 
region of W DNA, (*3) co-infec-ting. appropriate tissue* culture 

10 cells with the VV and with the construct and (4) selecting cells 
containing VV in which the pre-selected length has been swapped 
over ("recomblned") in vivo so that it is replaced In the genome 
by the construct DNA. 

In order to Insert the foreign gene Into the construct, the 

15 construct should itself be contained in a vector, e.g. a 
plasmid. It should also comprise a promoter for regulating 
expression of the foreign DNA within the virus. The procedure Is 
more fully described in the Mackett and Smith review supra . 
Vaccinia virus vectors have been used In this way experimentally 

20 for the expression of DNA for several viral proteins. See, for 
example, M. Kleny et aj..; Nature 312. 163-166 (1984) on the 
expression of a rabies virus glycoprotein. Since the vaccinia 
virus vector can be attenuated. I.e. altered to make it less 
virulent,, without impairing Its use as a vector. It has 

25 considerable potential for use 1n vaccination. 

It has been recognised* f©F some years that " to ppinc^p^e- 
similar technology could be applied to fowlpox virus (FPV). see, 
for example. M.M. Binns et al.., Israel Journal of Veterinary 
Medicine 42, 124-127 (1986), thereby providing a vector for use 

30 in vaccinating poultry. FPV like W, has a genome of vast size 
(it is even larger than W: estimates range from 240 to 360 
kllobases) and It is not known to what extent It is similar to 
vaccinia virus. 

One of the e.ssentlal requirements for the expression of 
35 foreign DNA in a FPV vector is a strong promoter, which will be 
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recognised by the FPV RNA polymerase. Several promoters have 
been Identified In VV but their relative strengths have not been 
fully explored. The main ones are as follows: 

1. p7.5. The 7.5 Kd polypeptide promoter, which has early and 
05 late activities, has been widely used to express genes Inserted 

Into vaccinia, S. Venkatesan et ah, Cell 125. 805-813 (1981). 
M.A. Cochran et ah. J. Virol. 54, 30-37 (1985). 

2. pll. The gene for the 11 Kd major structural polypeptide, 
mapping at junction of vaccinia Hindlll fragments F/E, has a late 

10 promoter which has been widely used, C. Bertholet et ah Proc. 
Natl. Acad. Sci. USA 82. 2096-2100. (1985). 

3. pTK. Promotes the thymidine kinase, gene which maps in 
vaccinia HI nd lll fragment J. J. P. Weir et ah . Virology 158 
206-210 (1987). This pronoter has not been used much and is 

15 thought not to be strong. 

4. pF. Promotes an unknown, .early, non-essential gene, which 
maps in vaccinia Hind lll fragment F, see D. Panicali et ah 
Proc. Natl. Acad. Sci. USA 80. 5364-5368 (1983). It ha*, recently 
shown to be "relatively Inefficient" I.e. 10-fold lower than the 

20 TK promoter, B.E.K. Coupar et ah, J. Gen. Virol. 68. 2299-2309 
(1987). 

5. p4b. The 4b gene encodes a 62 Kd core protein. It has a late 
promoter which maps in vaccinia Hindlll fragment A, see J. Rosel 
etah. J. Virol . 56. 830-838 (1985). The 4b protein accounts 

25 for approx 10X of viral protein in vaccinia. 

6 and 7. pM. and pi. These are two uncharacterl sed early 
vaccinia promoters from vaccinia Hind lll H and I fragments 
respectively used in construction of a multivalent vaccinia 
vaccine. M.E. Perkus et ah. Science 229. 981-984 (1985). 

30 8. p28K. Promotes a .gene encoding a later 28 Kd core protein, 
3. P.* Weir et ah, J. Virol. 61_, 75-80 (1987). It hasn't been 
used much. 

Because of the lack of information about the genomic DNA 
sequence of FPV (and, indeed, VV, since only about a third of the 
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genomic DNA sequence of VV has been published), It has not been 
possible to predict whether a particular promoter known In VV has 
a counterpart 1n FPV, nor could Its efficiency as a promoter be 
predicted. 

05 Only very limited data have been published about the DNA 
sequence of the FPV genome. Thus, O.B. Boyle et aj. , Virology 
156 . 355-365 (1987), have published the sequence of the thymidine 
kinase (TK) gene and flanking sequence totalling 1061 base 
paTrs. These authors looked at the FPV TK promotYr regfon* aW 

10 noted that it contained a so-called consensus sequence common to 
eleven VV gene promoters [A. Pluciennlczak et a1_. , Nucleic Acids 
Research 1_3, 985-998 (1985)]. This "consensus sequence" is 
supposedly based on TATA — - (20 to 24 bp) — AATAA, but there 
were many divergences from it and' the whole, region Is so AT-rich 

15 that the notion of a "consensus sequence" does not bear critical 
examination. Moreover, the distances between these consensus 
sequences and the 5* ends of the TK mRNAS differed as between FPV 
ar.d VV. Since the FPV TK gene was found to be expressed In 
vaccinia virus vector, and therefore recognised by the- VV RNA 

20 polymerase, some degree of similarity between these two promoters 
is deducible. !t does not follow, of course, that' every VV 
promoter would be highly homologous with every FPV promoter and 
indeed unpublished data of the present Inventors suggests that 
this Is. not the case. 

25 Further prior art Is referred to below after the section 

"Summary of the Invention"', wft'houf which ft's context would hot- 
be apparent. 

Summary of the invention 

Much of the present invention has arisen by locating some FPV 
30 genes, testing the 5'-non-coding region associated with them for 
promotional strength and thereby selecting certain strong 
promoters. 

Several regions of the FPV genome have been Investigated in 
research leading, to the- invention. One of them arises by cutting. 
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the DNA with the enzyme BamHI, selecting from a range of plasmids 
thereby generated one with an Insert of about 11.2 Mlobases and 
examining that length of DNA. Another arose by random cloning of 
the FPV genome and comparing these sequences with that of DNA of 

05 the vaccinia 4b gene mentioned above. 

Another method of Identifying strong promoters involved 
simulating the transcription of RNA from the FPV DNA. 

As a result, five strong "promoters have been found and the 
Invention provides various DNA molecules containing them. The 

.10» science of promoters'- of poxvirus DNA Is* at present poorly 
understood. It is known that certain regions to the 5' or 
"upstream" end of a gene serve to assist In transcribing genomic 
DNA into messenger RNA by binding the RNA polymerase Involved in 
the transcription so that the mRNA which contains the start codon 

15 of the gene can be transcribed. Such upstream regions are 
referred to as the "promoter". It Is often not possible to say 
for certain which nucleotides of the upstream sequence are 
essential and which are inessential for promotion, nor Is the 
minimum or maximum length of the promoter known with great 

20 precision. Although this lack of precision in the whereabouts 
and length of the promoter might at first sight seem rather 
unsatisfactory. It Is not a problem In practice, since, there 1s 
normally no harm in including additional DNA beyond the region 
which serves to transcribe the DNA. Further as doscrlbed later, 

25 it Is possible by tedious experiment to determine this region 
more precisely. In all these circumstances. It is therefore more 
appropriate to define the ; promoter' By reference to ttte gene which 
1t precedes, rather than by reference to the sequence of ,the 
promoter. Four of the genes In question are those of 

30 open-reading frames 0RF8, 0RF5 and 0RF10 of the BamHI fragment 
and the gene of FPV which most nearly corresponds to (is of 
highest homology with) the vaccinia 4b gene. The last-mentioned 
FPV gene Is conveniently designated FP4b. These genes are fully 
Identified hereinafter in Example 1. 



- 6 - 



The fifth strongly promoted gene was Identified by research 
into amounts of mRNA likely to be produced when viral DNA is 
transcribed. The theory is that strong promoters direct the 
transcription of greater amounts of RNA than weak promoters. In 

05 order to avoid the problems of experimentation jn vivo . RNA was 
prepared in vitro in a manner thought likely to emulate .in vivo 
transcription. The RNA thus prepared was hybridised to BamHI and 
EcoRI restriction fragments of FPV DNA. Strong hybridisation to 
sev.exal fragments, wa,s taken to. indicate, a, s.tr.ongJ.y promoted ge.ne 

10 and by this means such a gene which falls at least partly 'within 
a 0.79kb EcoRI fragment was Identified and the fragment partly 
sequenced: see Example 2. 

These five genes can be defined in various ways, always 
remembering, of course, that there will doubtless be minor 

15 differences in their sequence between one strain or type of FPV 
and another. One convenient, arbitrary, way of defining them is 
by reference to an appropriate length of the amino acid sequence 
which they encode. It may reasonably be assumed that the first 
10 or, more preferably, the first 20 amino acids, say. would form 

20 a unique sequence in FPV. Accordingly, one convenient definition 
of four of the genes Is based on the first 20 amino acids as 
follows :- 

(1) The FP4b gene which encodes a protein of about 657 amino 
acids in a sequence beginning 

25 Met Glu Ser Asp Ser Asn He Ala He Clu 

G.lu. Val Lys Tyr Pro Asn Lie Leu- Leu* G.l-a 

(2) The BamHI fragment 0RF8 gene encoding a protein of about 
116 amino acids In a sequence beginning 

Met Glu Glu Gly Lys Pro Arg Arg Ser Ser 
30 Ala Val Leu Trp Met Leu He Pro Cys Gly 

(3) The BamHI fragment OS. r 5 gene encoding a protein of about 
105 amino acids In a sequence beginning 

Met He lie Arg Arg Asn Asn Lys Ala Leu 
Gly Ser Val Met Ser Asp Phe He Lys Thr 
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(4) The BamHI fragment 0RF10 gene encoding a protein of 
about 280 amino acids in a sequence beginning 

Met Lys Phe Lys Glu Val Arg Asn Thr He 
Lys Lys Met Asn lie Thr Asp He Lys He 
05 Gene (5) could be defined as at least partly located within a 
790bp (0.79M)) DNA sequence, containing near Its 5'-end the 
sequence: 

(5') TGTCATCATA TCCACCTATA AATGTAATAT and near its 3'-end 
the sequence: 

10 AAGAATAGTC TAAATTACCT AACATAGAAC ATCAT (3') 

In relation to genes (1) to (4>, it will be appreciated, of 
course, that variations 1n the 20 amino adds are likely to occur 
between different FPV strains. Probably there would be at least 
90X homology over the whole gene, but there may well be less 

15 homology over the first 20 amino acids, perhaps up to 3 or 4 
differences. It is confidently believed however that no one 
skilled in the field will be in any doubt as to which gene is 
intended, whatever the precise degree of aberration in the amino 
acid sequence of the first 10 or 20. 

20 Likewise, In relation to gene (5> It will be appreciated that 

the quoted DNA sequence Is that of the EcoRI fragment detected, 
that in some strains of FPV one or both of the EcoRI restriction 
sites might be lacking and that consequently it is more 
definitive to quote the DNA sequence. Kith the aid of the 

25 sequence information given herein for the 790 bp fragment it will 
readily, b.e possible to compute the. sequencing of the 790 bp DNA. 
and then find an open-reading frame (ORF) for gene (5). This 
gene does not necessarily fall wholly within the 790 bp 
fragment. Thus, it might be necessary to sequence the genome to 

30 either side of the 790 bp region. This could* be done by 
labelling the 790 bp DNA and using It to probe a library of the 
FPV genomic DNA made by restriction with a different enzyme. 
When the beginning of the ORF is located, the 5'-non-coding 
sequence can be used as promoter DNA. If there should perchance 



be two genes fa-Ming within this fragment, whichever hybridises 
more strongly to the RNA Is Intended. Although there might be 
sane nucleotide- variation between strains of FPV, there would 
probably be at least 801 homology at the ONA level. It Is 

05 confidently believed, however, that no one skilled in the art 
will be In any doubt as to which gene Is Intended. 

It is expected that before long it will be possible to create 
a partial map of the FPV genome. FPV, like other poxviruses, has 
a linear genome with similarities between its ends: The terminal 

10 sequences are invertedly repeated. Within these terminal 
inverted repeats (TIRs) there are tandemly repeated sequences. 
The BamHI digest gave rise to clones containing these terminal 
Inverted repeat (TIR> sequences and it has been determined that a 
length of about 3.7 to 4.0 kb at one end of the approximately 

15. 11.2 kb fragment (the left-hand of the sequence thereof shown 
hereinafter) lies within a TIR in the strain of FPV 
Investigated. The F*Mb gene Is believed to lie in a central 
region of the genome. The whereabouts of the 0.79 kb sequence Is 
unknown at present. 

20 The Invention includes a DNA molecule which consists 

substantially of the non-coding DNA to the 5'-end of each of the 
above-Identified genes and comprising the promoter thereof. 
"Non-coding" means not coding for that gene : it could code for 
another gene as well as serving as a promoter. Any reasonable 

25 length of such DNA, typically up to 150, usually up to 100, and 
especially up to 80 nucleotides (or base-pairs In the case of ds 
DNA) of the 5'-end (even if it codes for DNA within the next gene 
along the genome), is herein referred to as "promoter DNA". 

The invention also includes a recombination vector comprising 

30 a cloning vector containing a non-essential region (NER) sequence 
of FPV. said NER being interrupted by DNA which consists of or 
Includes <a> promoter DNA of the Invention, followed by (b) a 
foreign gene (I.e. a gene which it is desired to Insert into the 
FPV vector) transcribable by the promoter. 
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In one particular aspect, the Invention includes a 
recombination vector which comprises In order : 

(1) a first homologously recombinable sequence of the 
fowlpox virus (FPV) genome, 
05 (2) a sequence within a first portion of a non-essential 

region (NER) of the FPV genome, 
(3) FPV promoter DNA according to the Invention, 
<4) a foreign gene transcribably downstream of the promoter 
(whereby when the fowlpox virus RNA polymerase bind.s to the 
10 promoter it will transcribe the foreign gene Into mRNA) and 

(5) a sequence within a second portion of the same NER of 
the FPV genome, the first and second sequences preferably 
being 1n the same relative orientation as are the first and 
second portions of the NER within the FPV genome, and 
15 (6> a second homologously recombinable sequence of the FPV 

genome, said sequences (1) and (6) flanking the NER 1n the 
FPV genome and being in the same relative orientation in the 
recombination vector as they are within the FPV genome. 
In another aspect, the Invention Includes a DNA construct 
0 which comprises a promoter of the Invention transcribably linked 
to a foreign gene. Such a construct or "cassette" can be 
inserted in a cloning vector, which can then be used as a 
recombinant vector useful in preparing a recombination vector of 
the invention. 

25 The invention further Includes hosts harbouring the 

re comb 1 nation and recombinant vectors of the. Invention* 
especially a bacterial host harbouring a plasmld vector. 

The Invention Is further directed to a recombinant FPV which 
Is the product of homologous recombination of . FPV with a 

30 recombination vector of the invention containing a foreign gene; 
the process of homologous recombination; animal cells Infected 
with such a recombinant FPV; a process of In vitro culture of 
these Infected cells; and a method of vaccinating' a responsive 
animal, especially a chicken, which comprises inoculating it with 

35 the recombination vector of the Invention. 
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Further description of the prior art 

At the International Poxvirus Workshop meeting held at Cold 
Spring Harbor, New York, on 24-28 September 1986, F.M. Tomley 
gave a talk, with slides, entitled "Molecular structure and 

05 organisation of an 11.3 kb fragment of fowlpoxvirus". This talk 
presented an outline of- the preliminary results of sequencing the 
11.2 kb BamHI fragment (at that time thought to be 11.3, rather, 
than 11.2 kb long). The talk dealt with the AT richness of the 
fragment.. In eluded a. sJIde siiowlng, 20 open reading frames, 

10 discussed codon usage in FPV, compared the FPV 48 kd predicted 
polypeptide (herein "ORF 1") with a 42 kd early protein In VV and 
compared other predicted polypeptides with hepatic lectins and 
anti-alpha-trypslnogen. No mention was made of the functionality 
of the ORFs or of the strength of gene expression, nor was any 

15 length of DNA sequence shown. The same talk was given at the 
Herpes/Poxvirus Workshop of the Society for General Microbiology, 
held at St. Andrews, Scotland, April 1987. 

At the corresponding meeting in September 1987, J.I. A. 
Campbell et ak, displayed a poster relating the terminal BamHI 

20 fragment of F?V. lying between the 11.2 kb SsmKI fragment and the 
end of the genome. No DNA sequence was shown. 

During the priority year, F.M. Tomley et aj_. , J. Gen. 
Virology 69, 1025-1040 (1988), have given the full sequence of 
the BamHI fragment, together with some detail of relationships of 

25 predicted polypeptides to other proteins. A study of the 
functional promoter activity of the sequences upstream of the' 1-2 
major ORFs Is referred to as unpublished data. The first 
disclosure of this data was in a poster exhibited by M.E.G. 
Boursnell et al... at the Vllth International Poxvirus/Irldovirus 

30 Meeting, Heidelberg, 22-26 August 1988. 

Brief description of the accompanying drawings 

Fig. 1 shows the general scheme of a procedure of homologous 
recombination as applied to fowl pox virus; 

Figure 2 and 3 are plasmld maps showing schematically the 

35 derivation of recombination vectors of the invention useful in 
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the homologous recombination; and 

Fig. 4 Is a plastnld map showing the derivation of a construct 
for testing FPV promoters of the Invention In a transient assay. 
Description of the preferred embodiments 

05 While the precise length of DNA required for promotion is not 

known. It 1s generally reckoned to be up to 100 base pairs from 
the RNA start site, but this can be as much as 50 base pairs away 
from the gene start site (the ATG codon). Accordingly ' a DNA 
s-equenc-e con to-toed within 1-50 ba-se pa»1rs% less* preferably 100* or 

10 even 80 bp, to the 5'-end of the gene (immediately preceding the 
start codon) is of particular Interest for the purposes of the 
.Invention. The DNA sequences of these 150 base pairs are shown 
below (arbltarily divided into blocks of 10 for ease of reading) 
for genes (1 ) to (4) . 

FP4b (5') TATTACGTGG ATAAATATAT ATCTTCAGGA AAAGGGTATT ATGTTACCAG 
ATGATATAAG AGAACTCAGA GATGCTATTA TTCCTTAACT AGTTACGTCT 
CTTTAGGTAC TTATTTTGAT ACGTTACAAG TAAAAAACTA TCAAATATAA 

<3') 

0RF8 (5') AGAATAGCAT TGCAAAGTTC TACACGATCC ATTGTATAAT ATAGGTGTTC 
AACACCTCTC GATA7ATCAT TATTTGTT7T TTCAATTTTA TTATAAGTAG 
TTTGAATGCA TTTTTAAGTT TAATAAATCT TGATAAAGTA TATTTAAAAA 

(3*) 

0RF5 (5') TAAACCAAAT ATACTAAAAT ATAAAATTAT GCCGCGGGAT GATAAGATAC 
TTCAGATGAT CGTGATGAAC TATATTTATT AATTGGCAAT ACTTAAAAAT 
AATGTTTATA ACATATGTAA ATATAATAAA CAATAATTTA GATTTTTAAA 

( T > 

0RF10 (5')ACTAGATTGT ACAAATATTA ATATGTGTAA TTTCTTATAT AGTAATATAG 
TAGGATGTGA TATATGCACC ATAGAAAAAT TTTATATTTG TATAAAACCG 
ATAAATAAAA TAAACTTAT7 TAGTTACTTT GTAGAGTATA CTAAATAATA 

(3') 

15 In the above sequences an ATG start codon follows on at the 
right-hand or 3'-end. 

Just how much of the 5'-nonrCOd1ng sequence Is necessary for 
efficient promotion Is not known precisely. However, experiments 
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can be carried out to answer this question, and in fact some have 
been performed for W. Consequently, similar experimentation 
would be possible to determine the sequences necessary for FPV. 
One such technique is deletion mapping: by the simple expedient 

05 .of removing parts of the sequence under test, and. assaying its 
subsequent promotion efficiency, the sequences sufficient for 
promoter activity can.be identified. Thus, in vaccinia it has 
been found that 100 base pairs (bp) of sequence upstream of the 
11 Mlodalton (UKd) gene are. sufficient to act a* a promoter' and 

10 temporally regulate late transcription C. Bertholet et aj.. , Proc. 
Natl. Acad. Sd. (USA) 82 2096-2100 (1985). Deletions leaving 
about 15 bp on the 5'-side of the putative site at which mRNA 
transcription starts still yielded high levels of expression, 
C. Bertholet et aj.., EHBO Journal 5, 1951-1957 (1986V However, 

15 H Hanggl et aj.., EMBO Journal 5 1071-1076 ' (1986) found that the 
same fragment functioned at a lower level when It was 
translocated to a new position. At this new position, deletions 
leaving 32 bp on the 5'-side of the ATG start codon had no effect 
on promoter strength. H.A. Cochran et ah, Proc. Natl. Acad. Sci 

20 (USA) 82. 19-23 (1985) showed that the activity of the 7.5Kd VV 
promoter resided in an approximately 30 bp segment. J. P. Weir 
and B. Moss, Virology 158, 206-210 (1987) found that 32 bp 
upstream o.f the RNA start site were sufficient for correctly 
regulated promotion of the thymidine kinase (TK) gene in VV. 

25 A 228 bp sequence of DNA from in front of a 28Kd late gene 

(*from positions. -218 to> +10 relative to fche> RNA> star-t *lte) was 
placed in front of the chloramphenicol acetyl transferase (CAT) 
gene and found to act as a promoter, J. P. Heir and B. Moss, 
J. Virology 61. 75-80 (1987). A series of 5* deletions extending 

30 towards the RNA start site were made. A gradual reduction In CAT 
expression occurred as the deletions extended from -61 to -18. 
Mutants that retained 18 bp before and 10 bp after the RNA start 
site still expressed the CAT gene as a late gene, though at a 
submaxlmal level. 
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While deletion mapping can define those sequences sufficient 
for promotion activity. 1t cannot pinpoint the exact bases 
necessary for activity within the defined sequences. Various 
workers have altered bases within putative promoter sequences. 

05 either by syntheslslng specific oligonucleotides, M. Hanggi 
et aj... loc. clt.. or by site-directed mutagenesis. J. P. Weir and 
8. Koss. J. Virology 61. 75-80.(1987). In both cases alterations 
In very few. even single, bases had profound effects on the 
e"f*f*k*Vency' of- promotion, and hence- favdl'V-idua-l* bases of importance 

10 could be Identified. Since, however, some changes In sequence 
are permissible without loss of the promotional effect. It will 
be appreciated that it 1s necessary that the invention should 
cover sequences which are variant, by substitution as well as by 
deletion or addition from the non-coding sequences of length up 

15 to 150 bp referred to above. 

The recombination vector could contain additional sequence to 
that herein referred to as promoter ONA. Additional sequence 
could comprise (a) additional sequence more than 150 bp 5'-ward 
from the'ATG initiation codon, (b) sequence inserted into the 150 

20 bp without destroying promoter activity or. (c> part of the 
sequence of the FPV gene (Inclusive of the ATG Initiation codon 
and onwards), e.g. up to 100 bp thereof. 

The above experiments require testing for the efficiency of 
the promoter. It Is not necessary for this purpose to introduce 

25 a promoter-gene construct into FPV and monitor expression of the 
gene product. A k shorter method v . known as transient assay. 1*s 
known for use with W, H.A. Cochran et aj.. , Proc. Natl. Acad. 
Scl. (USA) 82, 19-23 (1985). In transient assay, the promoter 1s 
linked to a gene with an easily assayable product.. A plasmid 

30 containing this construct is then introduced into a cell which 
has been Infected with the virus. The viral RNA polymerase can 
transcribe off the promoter, even though the promoter has not 
been Incorporated in the viral genome. Because expression only 
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lasts while both the virus and t'he plasmld DNA are present In the 
cell together, this form of expression is known as 'transient'. 
Two different marker genes have been used in vaccinia virus 
transient assay systems, the chloramphenicol acetyl transferase 

05 (CAT) gene. M.A. Cochran et aj., supra and the beta-galactosidase 
"JacZ" gene, D. Panicali et aj... Gene 47 193-199 (1986). Using 
the CAT gene the promoter sequences under test were cloned in 
front of a complete CAT gene which included its own ATG start 
codonv Thusv thi-s t* a» "feran-scriptlona^ fusion" sequence.. I.e.. 

10 the sequences are fused in a non-coding region. In the case of 
the beta-galactosidase lacZ gene both a transcriptional and a 
translational fusion vector were described, both for transient 
assay and for testing in recombinants.. The translational fusion 
vector contained a beta-galactosidase gene lacking Its own start 

15 codon, so that the fusion occurs within a coding region. The ATG 
start codon was provided by the VV promoter under test. The 
beta-galactosidase "lacZ" gene was therefore cloned so as to be 
in frame with the VV gene start codon, the VV gene being fused to 
the ]acZ gene before codon 9 of the latter. Thus, the promoter 

20 was in exactly the same context relative to the Initiation codon 
used 1n the fusion vector as In its native position. 

In the present invention, the lacZ gene has been used only 
for the transient assay to determine promoter strength. It will 
be appreciated, however, that in the practice of the invention a 

25 foreign gene relevant to improving the condition of poultry would 
be 1-nseHed- Into- the fowl pox* v-Ipusv Preferably- the- gene- will- be- 
one appropriate to an in vivo sub-unit vaccine, for example one 
or more genes selected from Infectious Bronchitis Virus (IBV), 
Infectious Bursal Disease virus, Newcastle Disease Virus <NDV>, 

30 Marek's disease virus. Infectious laryngotracheal s virus and 
genes encoding antigenic proteins of Elmerla species. Particular 
genes of Interest are the spike genes of IBV and the HN and F 
genes of NOV as described in PCT Patent Application Publication 
No. Hp 86/05806 and European Patent Application Publication No. 

35 227414A (both National Research Development Corporation). Ii 
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order for the foreign gene to be correctly translated in vivo It 
Is necessary for the foreign gene to have its own ATG start codon 
• Inserted 1n the region just following the promoter. 

It Is necessary to locate a non-essential region of the FPV, 

05 1n which to Insert the promoter of the invention and the desired 
foreign gene. In principle, they could be inserted anywhere In 
the FPV genome which would not harm the basic functions of the 
virus, or interfere with the' action of the FPV promoter or the 
foreign gene. It can be a coding or non-coding region. In VV, 

10 the thymidine kinase (TK) gene has often been used for this 
purpose. See. for instance. Example 4 of WO 86/05806 mentioned 
above, which describes the expression of the IBV spike gene in VV 
using the 7.5K vaccinia promoter and the TK non-essential region. 
It will be appreciated that the detection of the Insertion of 

15 the foreign gene would depend on detection of virally infected 
cells which do not produce any of the non-essential gene, e.g. 
TK. Such cells are described as "TK minus". Alternatively, one 
could use the TK gene or a markerless coding or non-coding region 
and detect the Insertion of the foreign gene by a hybridisation 

20 assay 1n which a labelled nucleotide sequence complementary to 
the foreign gene sequence Is employed. 

PCT Application HO 88/02022 published 24th March 1988 (CSIRO) 
describes a method of stably Inserting a foreign gene within the 
TK gene of FPV, with the aid of a dominant selectable marker gene 

25 ("Ecogpt") and a VV promoter. The disclosure of this patent 
application can be used in the present invention, with 
substitution of an FPV promoter of the invention for the VV 
promoter. Use of the FPV promoter is favoured as likely to be 
more acceptable to the veterinary medicine licensing authorities. 

30 The promoter of the Invention and foreign gene then have to 

be - inserted into the non-essential region (NER) of the FPV 
genome. The procedure of homologous recombination illustrated by 
Figure 1 of the drawings, provides a way of doing so. A fragment 
of genomic DNA containing tte NER Is sub-cloned in a cloning 

35 vector. If desired, it can be shortened to remove most of the 
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sequence flanking It. A construct 1s then made. In the cloning 
vector, comprising part of the NER (starting at one end thereof), 
followed by the FPV promoter ("P") of the invention, followed by 
the foreign gene, followed by substantially the remainder of the 

05 NER (terminating at the other "end thereof). This construct. In 
an appropriate vector, forms the recombination vector which 1s 
used to transfect the cells Infected with the FPV, e.g. by the 
calcium phosphate method, whereby recombination occurs between 
the^ NER' sequences frn* the* vector- and* the NE-R' sequences Vn the* 

10 FPV. The FPV then automatically re-packages this altered genome 
and the thus altered FPV (recombinant FPV) 1s part of this 
invention. 

Figures 2 and 3 of the drawings Illustrate alternative 
methods of making the above recombination vector. Referring 

15 first to Figure 2, a non-essential region possessing two 
restriction sites A, B is Inserted in an appropriate vector, 
which, by way of illustration only, will be described as a 
plasmid. In another plasmid having the same (or ligatably 
compatible) restriction sites A, B, a construct Is made of FPV 

20 promoter sequence of the Invention followed by the foreign gene 
sequence. It is of course essential that this construct 1s made 
so that the mRNA transcription will begin at or before the start 
codon of the foreign gene. Since It 1s time-consuming to 
determine precisely where the mRNA transcription start is 

25 effected by any particular promoter, It 1s convenient simply to 
Insert, say, V00* or more" pref erably 150 Base 1 pairs of promoter 
ONA Immediately preceding the FPV gene which It normally 
promotes, to ensure good working of the promoter. However, It 
will be appreciated that, given the time to do experiments 

30 previously indicated, portions of promoter ONA could be "chewed 
off" by restriction enzyme treatment to shorten It, thereby 
eliminating any unnecessary sequences. Such adaptation Is 
considered to be an Immaterial variation of the particular 
embodiments of the invention, described herein. Equally, It would 
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be possible to extend the promoter sequence at the downstream end 
thereof, e.g. to Include a few base pairs of Its natural FPV gene 
sequence. This would normally result In expression in vivo of a 
translatlonal fusion protein If . the foreign gene sequence Is 

05 arranged to be In frame with the natural FPV gene. However, such 
a protein Is not particularly desired and In fact any short 
sequence of nucleotides could be positioned between the promoter 
DNA and the start codon of the foreign gene. 

The restriction sites A, B are located in the plasmld DNA 

10 flanking the FPV promoter DNA and the foreign gene. Of course, A 
could be within the promoter DNA If it falls within a 
non-functional portion thereof. While two different restriction 
sites have been shown for simplicity they could of course be the 
same. They can be sticky- or Vlunt-ended sites and can be 

15 prepared artificially by filling In and/or VI gating additional 
nucleotides, In ways well known In the recombinant DNA field. 
Conveniently A and B in the type 2 construct are converted into 
identical blunt-ended sites (C, not shown) and then allowed to 
recombine at a single blunt-ended site C (replacing A, B> within 

20 the NER. Care will have to be taken, of course, to select sites 
which are unique In the vector DNA to prevent recombination of 
other sequences of DNA from occurring. 

DNA from the two plasmlds are 1 iga ted together j_n vitro and 
then transformed into the host, with suitable restriction 

25 enzymes, to produce the final construct of type 1. The 
promote r-fo/elgn gene construct of type 2 1s, of course, made 1,n 
a similar way from a vector containing the promoter and another 
containing the foreign gene. 

Figure 3 Illustrates another method of preparing recombinant 

30 vectors of the Invention. In this method one first prepares a 
construct comprising a first part of the NER followed by the FPV 
promoter of the Invention, followed by a short sequence of 
nucleotides containing at least one cloning site for Introduction 
of a foreign gene, followed by a second part of the NER, which 
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could be simply substantially the remainder of the NER. Cf 
course, virtually any length of DNA would provide a cloning site 
suitable In some way or other for Introducing a foreign gene. 
Preferably these constructs contain a multiple cloning site, that 

05 Is to say a short length of DNA containing the sites of a variety 
of different restriction enzymes,. for example at least ten. Such 
a construct then has versatility, since It will then be much 
easier to restrict DNA flanking almost any foreign gene at sites 
close to each end thereof and Insert the foreign gene into the 

TO* m'ulfi'pTe* cloning sTfe lll'dstrate'd* i'n Figure* 3. Only two sites X. 
Y have been shown, for simplicity and, again, these can be filled 
In and extended or chewed back, as desired, to give Identical 
blunt-ended sites (Z, replacing X, Y). In the final constructs, 
the promoter DNA will be separated from the foreign gene by a 

15 portion of the multiple cloning site, but this will not adversely 
affect the transcription of Ue mRNA In the final virus. 

In either method of construction", the NER is split by the 
promoter and foreign gene. It Is, of course, not essential that 
it be split 1n a central region.. Nor Is 1t essential that the 

20 second portion of the NER constitute the entire balance or 
remainder of the NER. So long as each end of the NER contains or 
is flanked by a long enough stretch of DNA for' homologous 
recombination, it does not matter that a part of the NER might be 
excised somewhere in between or that additional (irrelevant) DNA 

25 be Inserted in preparing the recombination vector. Obviously, It 
1s not necessary that the NER used be the complete region or gene 
Identified in the FPV genome as rion-essentlal. Any part o*f If 
will do, and the term "end" In relation to the NER then means the 
end of the selected part. 

30 References herein to vectors other than FPV (or. VV) mean any 
convenient prokaryotlc or eukaryotlc cloning vector appropriate 
for bulk production of the construct within a suitable host. 
Prokaryotlc vectors will ordinarily be plasmlds or phages. 
Suitable prokaryotlc hosts include bacteria. Eukaryotlc vectors 



- 19 - 



such as those of fungi, yeasts and animal cells, e.g. SV40, can 
be employed if thought more convenient. 

Although the recombination vector used will ordinarily be of 
double-stranded DNA, it Is possible to use single-stranded DNA 

05 for the homologous recombination. 

The recombination plasmid of the invention containing the 
NER, promoter and foreign gene then has to be "swapped over" for 
FPV DNA in a homologous recombination procedure. For this 
purpose, appropriate poultry cells are Infected with FPV.' It is 

10 best not to use wild type FPV for obvious reasons. FPV can 
readily be attenuated_(mutated to make it less virulent), by any 
conventional method of attenuation. 

Many different methods are available for selecting the 
recombinant viruses, and have been described for W 1n the review 

15 article of H. Mackett and G.L. Smith supra . Such methods are 
applicable in the present Invention. Using the TK ge _ e as the 
NER, one method Is to transfer the mixture of viruses containing 
the desired (recombinant) virus to fresh TK minus cells in a 
growth medium containing BUdR. . BUdR kills the original virus 

20 which was TK positive, so the TK minus mutants produced according 
to the invention can be selected. Another method 1s to enlarge 
the recombination plasmid to Include a FPV or, less desirably, a 
W promoter together with an additional marker gene, preferably 
selectable such as Ecogpt, but possibly non-selectable such as 

25 beta-galactosidase, within the NER and then detect recombinants 
by using a property of the marker gene, e.g. for 
beta-galactosidase the blue plaques generated when the 
5-bromo-4-chloro-3-indolyl-p.-galactopyranoslde (X-gal) substrate 
1s present in the growth medium. 

30 The selected TK minus cells containing the FPV (which has a 
deleted TK gene but possesses the foreign gene) are then grown In 
chicken embryo fibroblasts (CEFs), chicken fibroblasts, chick 
embryo eplthel ial " cells derived by conventional tissue culture 
methods. principally trypsinlsation of tissues or the 
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chorioallantoic membrane (CAM) of embryonated chicken or turkey 
eggs. For administration to birds, the recombinant virus can be 
given to b'.rds by aerosol, drinking water, oral, Intramuscular 
Injection or Inoculation into the wing web. Ingredients such as 

05 skimmed milk or glycerol can be used to stabilise the virus. 

While the invention Is Intended primarily for the treatment 
of chickens It is potentially of Interest 1n relation to other 
animals which might safely be Infected with FPV. It is even 
possible that it. might be considered safe to, infect humans with 

10 FPV after appropriate trials have taken place. 

The following Examples illustrate the invention. 

EXAMPLE 1 

MATERIALS AND METHODS . 
- 1. Virus strain . 

15 The HP438 strain of fowlpox virus was obtained from 

Professors A. Mayr and H. Mahnel. Ludwig-Maximl 11 ians University, 
Munich. The HP438 strain has been' obtained from the pathogenic 
Krl strain by 438 passages 1n chick embryo fibroblasts <CEFs) in 
tissue culture A. Mayr et ah, Zentralblatt fur Veterlna medizln 

20 B13, 1-13 (1966). The HP441 strain used to obtain DNA for 
cloning was derived by 3 further passages In CEF cells. 

2. Tissue culture medium . 

CEF cells were grown In 19? (Hellcome) medium, supplemented 
w.Hh Pen 1c MM n (200U/ml, Streptomycin (>2Q0yg/ml v Fungizone 
25 <2pg/m1) and 10% newborn calf serum (CS). 

3. Purification of virus and extraction of DNA therefrom . 

KP441 fowlpox virus was Inoculated on to confluent monolayers 
of CEF cells at a multiplicity of. Infection of approximately 1 
• plaque forming unit (pfu) per cell. Cells were pre-washed in 
30 serum-free medium, and the virus Inoculum was added to the cells 
In 1ml of serum-free medium per 75cm 2 bottle. After 10 minutes 
Incubation at 37*C to allow the virus to adsorb to the cells, 
10ml of medium containing 2X calf serum (CS) was added. After 5 
days, a marked cytopathlc effect (CPE) was observed, at which 
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time the supernatant was collected; Cellular debris was removed 
from the supernatant by centrtfuging at 2500 rpm for 10 minutes 
In a Sorvall GSA rotor. The virus was then pelleted from the 
supernatant by centrlfugatlon at 14000 rpm for 30 minutes 1n an 

05 Sorvall SS34 rotor. The viral pellet was resuspended In lOmH 
Trls pH9.0 and a further low speed spin performed to remove any 
remaining cellular material. 

To extract the DNA from the virus, an equal volume of. lysis 
buffer (lOOmM TRIS-HC1 pH 7.8. 2mM EDTA. 54X sucrose, 2X SDS, 

10 200mM 2-mercaptoethanol) was added to the virus suspension. 
Proteinase K was then added as a solid to 500pg/ml. This was 
Incubated at 50*C for 2 hours and then overnight at 4°C. The 
solution was then extracted slowly and gently for several hours 
with phenol/- chloroform/ isoamyl alcohol (50:48:2 v/v/v, 

15 saturated with lOmM TRIS-HC1 pH 7.5, ImM EDTA) and. then with 
ether. 2.5 volumes of absolute ethanol were added to precipitate 
the viral DNA. Viral DNA was resuspended in lOmM TRIS-HC1 pH7.5, 
ImM EDTA CTE) or in delonised water. 
4. Cloning of viral DNA Into olasmld vectors . 

20 lyg of FPV DNA was cut with the restriction enzyme BamHi 

(BRL) and Ugated Into BamHI-cut. phosphatase-treated pUCl 3 
plasmld (Pharmacia). Following transformation Into E. col 1 
strain TGI using standard methods, D. Hanahan, J. Mol . Biol. 166, 
557-580 (1983), colonies containing plasmlds with Inserted DNA 

2A fragments were Identified by a white colour on X-gal Indicator 
plates. Colonies were probed with nick-translated 
(radio-labelled) FPV DNA and plasmlds containing FPV DNA inserts 
were analysed by restriction digests of plasmld DNA Isolated by 
the method of D.S. Holmes et aj... Anal. Blochem. 1J4, 193-197 

30 (1981) and also of DNA purified on CsCl gradients. A range of 
recombinant plasmlds containing FPV DNA inserts was obtained, and 
one of these, called pMH23, of approximately 11.2 kllobases, was 
selected for sequencing. EcoRI clones of FPV DNA were made in 
the same way, except that colonies were not probed with 
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radiolabeled viral DNA but were stored In glycerol cultures as a 
•library', 

5. Sequencing of pHH23 . 

To sequence the viral Insert of pMH23, random subclones of 
05 pMH23 were generated by cloning sonicated fragments of pHH23 Into 
Smal-cut. phosphatase-treated M13mpl0 (Amersham International 
PLC). Clones containing viral Inserts were Identified by colony 
hybridisation with radlol.abe lied inse.rt from pMH23. Didepxy 
sequencing with C 35 S]dATP was used to determine the complete 
10 sequence of the viral Insert. 

6. Random sequencing of the fowlpox virus genome . 

Recombinant plasmids containing fowlpox DNA inserts were 
obtained by a similar method to the above, but starting from 
virus passaged a further three times (HP444). Random sequencing 
15 of the viral genome was carried out as 1n section 5 above. 
Sonicated fragments of viral DNA were cloned into M13mp10 and 
sequenced directly without any identification step. • 

7. Identification of putative promoter sequences . 

Sequences to be tested as promoters were Identified in two 
20 ways: 

a) Sequences upstream (Immediately 5* of) open reading 
frames in the pMH23 sequence were likely to act as promoters in 
the virus and as such were candidates for testing In a transient 
assay system. 

25, b.) Seqye.n.ce.s, upstream, of a. gene highly homologous, to. the, 4b 
gene of vaccinia virus were selected by comparing the amino adds 
encoded by the FPV DNA with those encoded by VV 4b. 

The open reading frames (ORFs) in pMH23, and the FP4b gene, 
were Identified as follows. 

30 (a) Open reading frames. 

The complete sequence of the pMH23 insert (the "11.2 kb BamHI 
fragment")" has been determined and is 11,225 nucleotides In 
length. This sequence is shown below (X - a nucleotide found to 
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differ when sequencing from different Ml 3 clones of FPV; asterisk 
« stop codon). Computer analysis of the sequence revealed the 
presence of several ORFs. If only ORFs of greater than 150 bases 
in length are considered there are nineteen complete potential 

05 genes, predicting polypeptides of between 58 and 418 amino 
acids. The ORFs numbered 1-12 were considered the major ORFs. 
either because of their size or because of their codon usage. 
The start and stop positions of these ORFs are shown 1n Table 1 
bel'ow. Seven other ORFs were considered minor, eTfher because 

10 they overlap or are- contained within other potential genes, or 
because of their codon ^usage. 
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1 GGATCCGACGCGGCTGCCAAGACCTTTATACCCGACTCTTGTTCTACTGGACGAACGCGG 
61 AGATTTAAAGCCATGGCTGACGTATAGTCGAGGACGCCCTCGGTAATAAATTGATTATAT 
121 TTTCAGTTTTAAAAAATTAATTTATATGTACTCAATATCCTTATATAGAATTATTTTATC 
181 TCTTCTGATATACGTTAGGTAGATGCCGTTCAAATAATAAAATATCTGATGACGTTTTTA 
241 TGCGCGTGTTACGTTATTATAATAGATAATAGAAATAAACGTTAAAATAATAATTAATTA 
301 TCTTTTCAGTTGTTAAATATATTCTAGTTTTATAAGCGTTATTCATATATAAAAAATATA 
361 AAAACTAAATCGTATTTATTATGATGCTACGGCGGTCATTTAACAAATTTACGCGATGGA 
421 GTTCGGTTGTACGGGAACTAATAACCAGTTGGCCGTTCACAGATTTACAGAAACGCGTTT 
481 TACATCTTTCAAAAAAGAACTTTTAGTTAATTTAGGAATAAGTGACTTAAATGATATAAA 
541 AAACATATGCGAGGATTCTAAAATATTCTTTCCGGAAAAGAGAACGGAGCTCTTAAGTAT 
601 TAAAGATCGTAAATCTAAACAAATAGTTTTCGAAAACTCCCTAAACGATGACTTGCTTAA 
661 AAAATTACACGCCTTGATCTATGATGAATTAAGTACGGTAGTAGATTCCGTTACCGTAGA 
721 GAATACCGTTACATTGATTATGTATGAAAAAGGAGATTACTTTGCCAGGCATAGAGATTT 
781 TAGTACCGTCTTTTCTAAAAACATAATATGCGTTCACCTGCTTCTATATTTGGAACAACC 
841 AGAAACGGGAGGTGAAACGGTTATATATATCGATAATAATACGTCAGTGAAATTAAAAAC 
901 AGATCATCTATTTGATAAAACTATAGAACATGAAAGTATTACCGTTGAAAGCGGTAGAAA 
961 ATGCGTGGCGTTATTCGATGTCTTACTAGAAAAAAAGTTATCCGCGTCAACAAACGTAAT 
1021 AGGTAGCATAGAATACTTAGGTAAAAAAATAAATTTATATGACAGAGAAAATGATCTTCA 
1 0,8 1 GTJ.GJGJTATT.GXGATAIGGJAATAGAAAGAATGACAGAAGATAAAGAATATAGCC.TAGG 
1141 AATGATATCTGATAGATCAGGTAGATGTATAAAATCTCATCATAACGGTAGTATTGTTAG 
1 20 1 ATACCGT AAAGAAGAATATGGATCTTTCGATGCTCTATGTATATATAACATGAATGAAGT 
1261 GGATGAAATTTGGACTGGTGATAAGAAACATATTATATGGTCTACTATTGATAAAAAAAC 
1 32 1 AGGAACGTCTTTTATACCTATAGATCCTGTACTTTACGAAAAGTTAAAAGCTATTTCTTC 
1 38 1 TAAAGAGCATAAAGAATACAAAGATTTGAGAGGGTTTTGTAATAGCAGAACGGAGTATAT 
1 44 1 TTGTTGTtCGGTATCTAAGTACTATTTCGACTTACCTACAAAAACAGATTTAATACACGA 
1 50 1 GGTGATTAATTCTATCGATTATGATACTAAGTCAGTGGGTACACCCGACTGGTATACTCT 



1 561 GCCTATACAAGTT/ AACAAACTATCCTAGGTAATATGTCTTACGAAGAGTTATTTAATAT 
1 62 1 AGTAAGAGGTAATATAGCTCTTGAAGAAGACAATGAATATGGCTGTGATTAACATTAATG 
1 681 GTAATACTTTTCTAAAAACTAATCTCAAGTATTGTTTACAAGCGACTGAAGTAATAGTTT 
1 741 TAGCAAAATAATACCTTTACTGTTAGTTCTACAATCGAAATTATGCTGTAACATGAGGTA 
1 801 AGGATATATTTATTAATACGTTACATCTT7CGAAAGACTTTGATCGTAGTATAATATTAT 
1861 ACATCTGCTCTACTTATTATACATAAGAAAATUGTATTTTA7TTAGTG.CGCTGATAAAJ 
1 921 CGTGTTTAAAGTATACAACGGACGTCTATTTCCAAAAAATCTGCGCGTGTTAACGGATTA 
1981 AAATCTACATGAAAATATCTCTTAAACTTTATTTCTACGTATAACAAACAACAGACTGAT 
2041 TTTATATATTACGAATAACTATTTTCTTAGGTTTTTTATATAGATGCTATACAGTGTTTT 
2101 TACGCGTATATACAAAATACGGAAAAATAATAXAACAGAAATGATTCTGGCAATATACGA 
2161 CCGCAATGCCTATATTGTTAAAAAAACAGGTATCGGAAGTATCTTGTTACGCGATAACGG 
2221 TACTAGGAATACTATGCTTAATATTATTTACGATACTAGTAGTCGTAACATGCAAATGGT 
2281 ATTACGCGTTTCCGTACTTTAGCAAGGTATGTCCTGATGAGTGGATAGGATATAATAGTA 
2341 AATGCTACTACTTTACTATCAATGAAACTAATTGGAATGATAGCAAAAAACTATGCGATG 
2401 TTATGGATTCTTCATTGATAAGGTTCGATAACATAGAAACTCTAAATTTCGTGTCGCGAT 
2461 ACGGTAAGGGTAGTTACTGGATAGACATAAATCAAAATAGAAAAATTCCGGGTATTAATT 
2521 TCTGACTATATTATGAACAAGGCGTTAATGATATTTGTCTATTATTTGACACGAGTAACA 
2581 TTATCGAAATGTCTTGTATATTTCACGAAAGAACGATATGTGTTAAAGAAGATAGATACA 
2641 CCGATTGGTATAGCGAATAGATGCGT'T-AGATTTlAGT'AeCTe^TTT'TTATArAATAGTATT 
2701 TTGTACGTTCTTGTAAACAGAAAATCCGTATAGTTTATATTTTTAATCAAAGTAATAACG 
2761 AATATCTCGATGTCACGTATAAACGCAGATTCTAGATATTAAATTCTCAACGTACGTCAT 
2821 TTGCATTCCCTGAGATGATACTTTGCTATTTTATTATACCGTAGTCTATACAACCACTAC 
2881 AAAGTTAAACGAAGTAAAATTATTGATTCGTTGTTATTATTTCAGCACAGTAGTACTCGC 
2941 TATCTTCGTTTAAATCTAATAACACGCCCTTTGAAACATTTTTGTGCTAGATAATAATAC 
3001 GTTATTATTACACTAACCTGTATTTC.TTCTAATCTTTAAGGT.GIGCTAACGATATATCAC 
3061 GGGATTAAAAGGTTATTAGTAGTCGTATAACAACATAATAATAGCACATCTGTATATTTA 



3121 TATACCTCTCGAGTACATAAAAATAATATGTTTTGATAAAACGTAAATCAATAAGTGTA7 
3181 AAGGTATTATTTCTTTTAATGAAGAAATAGGACGTAATGTCTAAATCAGATTTATATTCC 
3241' CGAAAATATTTTTCTTAGATGTATATGTTAGTTAAATTACGTGATTATATTATAAGTTAT 
3301 CTGCTTACTTTAACATTATATAGTAATTATATACTAACCGATCTTAACACTTCCGTACAA 
3361 AGAGGTATGCCCGCATCTGCGAGATATTGTGATTTTCGTATTTAGATATGTGAATATAGT 
3-421" PAfGTACT'AAGGGGAOT'HGGTGSAM^T'AGAA 

3481 ACTACCACGTTCCTCTTTTAAGAGTTAACTATTTACTCGGAGGTATCGGTATACATACAA 
3541 TTCTATATAATTTAGTTAATCGCTTTTTACGCGCATAAGTCTACGTATAATGTCTTTGTT 
3601 TAAGTAACTATCCCTGGAATATTCCTAAAAATAGCGGAATTTTTGTTTGTACGTCGGCTA 
3661 C l AG'oAACA i GhmAuu i mhj i icoli i i iav.'jmT«Go«/m i i iui i mi itwiuumu 
3721 TGCATAATTCGGTAACACTAGCTGCTTCAGTTCCGTATTCATCTACTTTTATCACAGATT 
37B1 TTTGCCTGATATTACCTATCCTCAAAGTTTTTGTATCGGATATACCTACTAATTCACCTG 
3341 ACTTGAATAGATCATTACATCCCATATGGATTAGCGCGTCTTTCAAGTCTACGTCATCTT 
3901 CTAATTCGAATTTAGGTAAATAAAGAACTATTTCTTTCAAAGTCATATCTTTTTTAGATA 
3961 TTATTTTATTGATATTCTTACCGTTATTGAGAGAATCAACTACTCCTAAAATAGAAAAAG 
4021 TATTAAAATTACGTAAACATATTAGTTTTAACATCTTTTTATTTGTTTAGTATATAAACT 
4081 TATATCGTAAAGAAATATAGTTCTCTTAATTTACGTTTATTAGGAAATAAAATAGACATA 
4141 TAGATATACACCTTAGATACTTAATTAAAATGGATAGAAACATTAATTTACCCGAAGAAG 
4201 AGCTTAAATATATAAAAGAATGTTGCGAAGTTCtTTATTTACCCCAGCCGACGAGAAtGG 
4261 ATATAATCGGTGTTATGAATGATAGCGATATTTCTTGGAATGAAAATCTCATCATTCTAA 
4321 TGTCGGAAGATGGTAAGATTTATGTGTACGACGATGAAGCTCTATACAAAGTAGCGGATA 
4381 CTATGGAAGAGTTCTCTGAAATAGGACTTATTAATCTAGGAAATGAAGTTTATCATTGTA 
4441 GAGAGGATATAAAACCTCTTCXGAAGAGGATAGGGATAAGGATGAGTATATAATGAAGA 
4501 TAAGGGAAAAAGCCAGGCAGCTTATAGATAATTCACAAAAAGATTTTGAGGCCATTCTAG 
4561 ATTCTTTGGAGAATAAACATGTATCAATT'TAGGTATATAATATAAGGT'AGCAAAATACGT 
4621 ATGTCCGTGTACGCTTATGTATTTTTTTATTTGGATTAAAATCGATACGCTAGAGAATAG 



4681 CGGAGTAGCTTCTGTATCCGCCGCGGTTATTTACTTTAGTAATCTATTAAACTACTTTTA 

4741 TCTCTATTATTAAGTTAGTCATACCCACGAATATATATTCATAAAAACATCTTCCTCTCA 

*KLKTLEEKLCKN 
4801 GATTTTCATCCGTAAAATTATTACTTTAATTTTGTTAACTCTTCTTTTAAACATTTATTT 

EEKLALLQIRIETIY RSYIN 
4861 TCCTCTTTGAGAGCTAAAAGTTGTATTCTAATTTCGGTTATATACCTACTATAAATATTA 

D I I K Y Q? H K L Q D* V- I- C L E K I E 
4921 TCTATAATTTTATATTGGTTCTTAAGTTGATCTACTATCATACATAATTCTTTTATCTCT 

QRYKEDLEKLISNCKIDIES 
4981 TGGCGATATTTTTCATCGAGTTCTTTTAGAATACTATTACACTTTATATCAATTTCTGAT 

KI EKINSDYEENITKI FDSM 
5041 TTTATTTCTTTTATGTTACTATCATATTCTTCATTTATTGTTTTTATGAAATCACTCATT 

(ORF5) 

VSGLAKNNRRIIM 

5101 ACACTTCCAAGAGCTTTATTATTCCTACGTATTATCATTTTAAAAATCTAAATTATTGTT 

5161 TATTATATTTACATATGTTATAAACATTATTTTTAAGTATTGCCAATTAATAAATATAGT 

5221 TCATCACGATCATCTGAAGTATCTTATCATCCCGCGGCATAATTTTATATTTTAGTATAT 

5281 TTGGTTTATTACGTGCGTAGATTTAGAATCTTTATTCACACCCGATTATTGTGTTGATAG 

5341 TATATAATATTAAAACAATGGAGTTTTAAGCTCTACCAGAAGATATCATTAAGTATAGCG 

5401 TTCTATATGATCTAAAACATGTATATTGTACCTAGTGATAATAGCATTTTTACCATTTTC 

5461 GTTTATATTGCTAGCTCATCTATACGTAACTTTATGGTTTATTAGCTATCTCATGTAACT 

5521 ACATATTGTTATCATCGTTTAACAGTATTATTTCTTTTAACTGATCCATTAAACTTTTTT 

5581' TATGTATfAGCTCATA'Tft^ YATAAACTTTA 

5641 CTACCTTCAAAGAAAATAGAGGAGAAATCCAATGTGAAA7ATGTAATATAAGGTCGCGGT 

5701 GGACGTACAATTCACTTGTTTCGCTGTCCGATACCACATTTAATACTATTCCCCTATAAT 

57*1 CGTAGTAGTCATTGCATGATCTATTTATCCTGTCTAATTCATTTATTAATTCTACGGAGG 

5821 ACTCCTTACTCATCCAATCTAATATATCTCTTCCTCTAGAACTACATAACCTTGTAGCAT 

5881 TTATGTATTCATTTTCTTTCATCATAATAATTTCTATATCTTCGTAACTTAGCTTACAAA 

5941 AGHATTATTGATGGANCTACTTTGATTTeGATATTGAATAATTGTT^ 

6001 ACAAATACTTAATTTTCATAATTTGTTAATAACCTAAATATTTGTATTTCTCTATAAAAA 
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6061 CCACATACAAAAACTATTTACATTATTATTCCAGACAATAGATTATGGTATTTTTGGGAT 

6121 CGGTACAAGCAAGTGTTATAAAGCAAGTAAATCTGGCCTCGAATTCAACATAATCACCTT 

6181 CCACAACA7AACCGCTTTCTTCTTCCGAAGATTCGGACAATCGCTATGATAAAAGTATTT 

6241 ACTAGTCGTTGAAATAAAGATGTAGAATTGCCCATTATATTATAATTTAGTCACTTATTT 

6301 GTTTATTTTTTTAGTACACGCICTATCTTTCTTTACATCATAAGGCAATATTTATCATAT 

6361 ATCACGATAATCAGGATATTTATATATGTTTAATAACGGCTTTTACGTTTTATTGATTAA 

6421 GACGACACGGTAACAAAATTAATATACTTATATTGTACTACATAGTTAGCAAAATATCTA 

6481 TTAGAATACTTGTTTTGCCTATGTTTACTTCTATATTGCTATATAAGACTTATCACCTTC 

6541 AATATTTCTGTTTGTACCATATTCATGACTAGATTTTTCTATATCAAAATATATATTTAG 

6601 77A7AAAAA7AA7777A777CA7AGA7G7GA7G7CAAGC7C777A77GCC7A7A7A77CA 

6661 AG7A7G77G7A7777A771CA7AGA7GCGA7G7CAAGC7C777A77GCC7A7A7A77CAA 

6721 G7A7G77G7A7777A777CG7GGGG7AACCAA77CCA7777G777CA7CACGAG7AA777 

6781 777CA7C7A7AAC7CGCA7CGC7GA77CAA7AGC77CCGC7C777GCGA7GCCG7G7C7G 

6841 CCAA77C7777AA7AGA7A777G7AGAA7A7GGCA77A7CA7ACAGACC7AA7A77777C 

6901 7AGAA7G7C77GCCAA7A7G77C7CA7CAAGA77777GGA7GG7777AAACACAGG7CCA 

6961 GAA7G77G7AGG77C7GA7GC777CGC7G777A77C7CC77AA77CAA7777ACA77777 

7021 CAAA7ACA7C7777AAACGAC7777GC7GT7AA7GAC7G7CA7G777C7GGAAAA7CC77 

7081 7A7CCGA7GA7A77G7A777G7A7A77G7C77AA7GCTA7G7CCGCTA.7CAG.CA7A7CCA 

7141 CGGA77CAGA77C7GGA777GTA7CCATAT7ACAGA7CA7C7CTAAAGTTG7G7G77C77 

7201 CA77CA7CACGG7AAACACAATG77ACTATCAGCGCCTC7CT7GAGAAACA7GCT7ACCA 

7261 TA7C7A7T77G77G77T7GTATAGCGTAGCACATCGCTG7CACACAGGGCCT7T7GCTGA 

732 1 AATAG7CAA7G7TTGCTCCGGAA7CTAGCAACATCCTGCATACTTCTGTG7C7CC7T7GC 

7381 7CA7GGCGA7GA7AAGGGGAG7ACA7CCGTAGCAATCTTCTA7G77GG7ACAGGC7C7G7 

744 1 GA7C7AATAGCAA77C7A7ACCT77AATATC7TTTGACATAACAGCTAAA7GGAGAGGCG 

7 50 1 TAAAACGA7CAG7G7TGGGCACA7CAG7GTCGGCTCCTCTAGCTA7AAGGAGCCTCA7CA 

7561 7G7CAAGA77777AC7AAT7GTGGCCAAATG7AAGGGAG7G777CCT77C77G7AGA7AA 



I 
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7621 CATCATTTATGAACTTTCCAGAATCTAATAATTCTTCCACTTTAACAACGTCTCCTTCTT 
7681 CCACGGCCTCATGCAATTCAGATTCTATATCCGGATAGTTATAATCGGGATAAGTGTTGT 
7741 AACTCATCAGTAATTTAATCATTTCAACATCTCTAAGTCTGACGGCCATCTTTATAGGCG 
7801 AGTATCCGTTGATAGTAAAATTCGGATTGATGTAAGAATCCAACAGGCGTCTAGCCACAT 
7861 CCAGTTCTCCAAAGAGAATAGCATTGCAAAGTTCTACACGATCCATTGTATAATATAGGT 

7921 GTTCAACACCTCTCGATATATCATTATTTGTTTTTTCAATTTTATTATAAGTAGTTTGAA 

(CRTS') 

M E E G K 

7981 TGCATTTTTAAGTTTAATAAATCTTGATAAAGTATATTTAAAAAATGGAGGAGGGTAAAC 

PRRSSAVLWMLIPCGSIIIV 
8041 CGCGACGTAGTAGCGCAGTATTATGGATGTTGATTCCATGCGGAAGTATTATTATCGTGC 

LSVFVIILSTRPPVPPDIKI 

8101 TA7CTGTATTTGTGATTATTTTATCCACAAGACCTCCTGTACCTCCAGATATTAAAATAC 

LYCKEGWVGY N KNCYFFSEE 
8161 TTTACTGTAAAGAAGGATGGGTAGGATATAATAAAAACTGCTATTTTTTCTCTGAGGAAA 

KN NKSLAVERCKDMDGH LTS 
8221 AAAATAATAAATCATTAGCTGTAGAAAGATGTAAGGATATGGACGGGCATCTGACTTCAA 

ISSKEEF KFILRY K G PGN H W 

8281 7TTCTAGCAAAGAAGAATTTAAATTTATCCTAAGATACAAAGGTCCGGGAAA7CACTGGA 

IGI EKVDFNGT* 
8341 TTGGAATAGAAAAAGTTGATTTTAATGGAACTTAGAAATTAGAAGATGGGTCATCTTATG 

8401 ATAATATAGTTCCTATCAAAGGAATAGGTGATTGTGCATATTTAAGCGATAGATCTATAA 

8461 TGTGGTCATTTTGTTTTTTACCGAAGAAGTGGATATGCAGAATAATACTTTTATAGAAAT 

8521 GCTAGCTAATAATGTATAATATTTTTATGAAAAAATGGAAATTGATATGCATAATTATAA 

8581 CCAAAAGTATGATATTGCAAGATGTCTTGTATACTTTGATCATAGGTATACATGAGCAGT 

8641 TTAAAATATGCAAATACAGATATAACTATTAAGATGGTGATAATAACACCGAAAGTCTTG 

8701 GAAGATGATAGTTTATCAGAATCAAGTATCCATTTTGCGAATAACAGATTCCATTTTGAT 

8761 TTGTATTATATAAAGCCTTGGGCCTTCGTAAGTATATTATATTTATTTTTATGTTTTTTA 

•EYN NAVAPKAT 
8821 TATAATATTATTTAAAACCTTTACTATTCGTAATTATTCGCTACCGCJGGUTTGCCGTT 



v. 
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TE I YDRI KVI I ENFSOGS LM 
8881 GTTTCTATATAGTCTCTTATTTTTACTATTATTTCATTAAATGAATCACCACTAAGCATA 

IKSINI RDEAHQ'KRIGSI IS 
8941 ATTTTAGATATATTAATACGATCTTCTGCGTGTTGTTTTCTTATGCCGCTAATTATTGAC 

LYRRNNEEIEA EIAAIMED T 
9001 AAATAGCGACGATTATTTTCTTCAATTTCAGCTTCTATAGCAGCTATCATTTCATCTGTA 

RI DENNDIDQEMSYD K ELVR 
9061 CGGATATCTTCGTTATTATCTATATCCTGTTCCATAGAATAGTCTTTTTCTAATACTCTT 

EAKT. LLY'TSLKSYYSTEIEK 
9121 TCAGCTTTTGTTAGTAAATAAGTACTCAATTTACTGTAATAAGATGTTTCTATTTCTTtA 

FRNSIDK ISRDYVEDIKS IQ 
9181 AAGCGATTACTTATATCCTTTATAGATCTATCATAAACTTCGTCTATTTTGGAAATCTGA 

NQLEQSHDIDKE TEINLAVC 
9241 TTCTGTAGTTCTTGGCTATGGTCTATATCTTTTTCAGTTTCTATGTTTAGCGCTACGCA.T 

RRYYRML ISVR TAVYPS I HP 
9301 CGTCTATAGTATCTCATGAGAATAGATACTCTAGTCGCTACATATGGAGAAATCCATGGA 

IVYOL LLOSVYEGNIKI RTH 
9361 ATTACATAGTCCAGTAGTAATTGTGAAACGTATTCTCCGTTTATTTTTATTCTAGTATGC 

RIN1IPDHLKNNDKLTNLLR 
9421 CTAATATTTATTATCGGATCGTGTAACTTGTTGTTATCTTTTAATGTATTGAGCAATCTC 

RSSELQKWDNFDKGELNCLR 
9481 CTAGACGATTCTAATTGCTTCCAATCATTAAAATCTTTGCCTTCTAAGTTGCATAATCTA 

TINVFG NSRMIVVEVDLLKM 
9541 GTAATATTTACGAACCCGTTACTTCTCATAATTACTACTTCTACATCTAATAATTTCATA 

S M ; F Y, E N I- G C 1= K I D- T- I H> M* K K* I 
9601 GACATAAAATACTCATTAATGCCGCATATTTTTATATCTGTTATAUCATCTTCTTGATA 

(0RF10) 

TNRVEKFKH 
9661 GTATTTCTAACTTCCTTAAATTTCATTATTATTTAGTATACTCTACAAAGTAACTAAATA 

9721 AGTTTATTTTATTTATCGGTTTTATACAAATATAAAATTTTTCTATGGTGCATATATCAC 

9781 ATCCTACTATATTACTATATAAGAAATTACACATATTAATATTTGTACAATCTAGTTCGT 

9841 CTACTA7TTTTATCCAATAGTCCTTAGATGTATTTAATAAGCCACTATTCGTATTTATGT 

9901 TAATATTATTCCCACCGCCAAGATTATCACATACCATCATGCTATCATCCCAACTTAACT 



9961 TATTTTCGGAAATAAAATAACATAAATTATCGAATTCTAACCAGTCTTTACCACACCTTA 



$ 



- 11 - 

1 002 1 CTAAATATCTATCTCTGTCTATATCTACTAAAATAATAACAAATAACAATATAGTGAAAG 

1 008 1 CTATCGTTAATAGACCGCGTTTCCTAGCTTTTTTACACATTTTCTTATCATATTTATATT 

10141 ACTGTTTTTTACAATTTTTAATATTATTTGTCTCATTTTGTAGTAGTAGATTTCGTAAGA 

10201 TCATGTCATCTAATTTTGTCAGTATCATCCATCTAATTTCTATGGGTAAAGTATACCATT 

1 02 6 1 T TGTATT TACTAGGT TTGCATTCATTATATTGTTTATCTCTAATAACATT TCATATCTTT 

1 032 1 TTGTCAACATTTTTAATATATTTTGTATTATACGAAAACAGTTGGGAAATATTGTTTTGA 

10381 nATATTCATnTTTCTAAmrGffl^ 

1 044 1 CGTTAGATAATAAATCGAAAAGACTTAAATCTTGAAAAGAATTTCCGTCGTATATCTTGA 

10501 ACGTTTTCATACGTTCTATTTCTTCTTTTATAATATTTATGCAGGAACTTAAGTATTCAC 

10561 ACTTATTAATTATTTTCATATTCTTTTCCATTCCGTTAGAAATTCTAGCTTTGTAAGATA 

1 062 1 AGTAA7ACAATGATACTATATAGTTAGCAAGAATAATAGCATTATTGTTAATAGTATGTA 

10681 ACATAAAGGTGTATTCCCTCATCATCTAAAGCGTTATATCAGCACCGTGGTCTATTAATA 

1 074 1 CCAATATATTACTAAAATCATTATATCGTTCTAA1 ATTATTCGTGTAATATATTCTACCC 

1 080 1 ATTCTTCCTTTATATTTATATTAGCTCCTCTAGATATGATGTAATCTAATAGGTCGTCGG 

1 086 1 TAATAAACCTAGTTTCGTATAAGGGGGATGTATTAGTTAAAACGCTTTGTTTGTTAATAT 

1 092 1 CGGCGCCGTGGTCTAATAATACTTTTATTATTTTTAATCTAAACGGATCGTATACTTTCA 

1 098 1 TAGCGTAATGTATAGGGTATTTACCATTCGCGCCGTCTTCTGAATTAATGTCAGCGCCGT 

11041 ATTCTATAAGCAATTTTACTATTTTACTTTCTGTTCTATTAGCAGCTATATGTATAGGTT. 

11101 TCAAACAATAATGTTCTAAATTAACAATAGCTCCGTATTCTAATAGCGATCTAGCTATAT 

11161 CTACACAACCTTTTTTTATAGCCTTATGTAATGGTGGTGTAAAGAACCCAGAAATGTTAG 

11221 GATCC . 



r -32- 

TABLE 1 

Open reading frames In the fowlpox BamHl fragment In pMH23. 







9 


No. Of 










amino 


Size In 


ORF 


Start 


Stofi 


acids 


kl lodaltons 


1 


416 


1672 


418 


48.2 


2 


2166 


2669 


167 


19.8 


3 


4054 


3608* 


148 


15.4 


4 


4170 


4592 


140 


16.5 


5 


5138 


482'* 


105 


12.5 


6 


5974 


5519* 


151 


17.9 


7 


• 7906 


6674* 


410 


46.8 


8 


8025 • 


8374 


116 


13.2 


9 


8632 


8835 


67 


7.9 


10 


9686 


8844* 


280 


33.0 


11 


10120 


9689* 


143 


16.6 


12 


10705 


10139* 


188 


22.4 



* ORFs 3, 5, 6, 7, 10, 11 and 12 are transcribed on the 
complementary strand to that shown above, i.e. in the reverse 
direction to the others. 

Sequences upstream of the eleven largest major ORFs were 
0.5. done.d. into lac Z transla.t1o.nal fusion vectors, for the measurement 
of promoter activity In a transient assay system. 

(b) FP4b gene. 

Random clones of fowlpox virus DNA were sequenced. The 

sequence of each clone was translated on the computer into the 

10 six possible frames and compared to a library of published 
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vaccinia sequences. Several fowlpox genes with some degree of 
homology to vaccinia genes were detected. One gene Identified In 
this way was a fowlpox gene highly homologous to the vaccinia 4b 
gene (this Is referred to herein as the FP4b gene). The M13 

05 clone containing these sequences was used to probe an EcoRI 
library of fowlpox virus clones (see above) and a clone 
containing DNA of 2.7 kllobases was detected. The clone was 
sequenced as described for pMH23 and found to- contain the' 5»' end. 
of the FP4b gene, upstream putative promoter sequences, and the 

10 3' end of another open reading frame. 
8. Assay for strength of promoter . 

(a) Translational fusion vectors. 

Translational fusion vectors allow potential promoter 
sequences, up to and Including the initiation codon of the test 

15 gene, to be fused to a gene with an easily assayable product. 
Thus the promoter sequences under test are in exactly the same 
sequence context relative to the start of the ORF as in the 
original gene, and only the coding sequences of the gene are 
altered. The translational fusion vectors used in this case have 

20 the beta-galactosldase gene (UcZ) as an assayable marker and are 
called pNM480, pNM481 and pNM482. They are modifications of 
PMC1403, M.J. Casadaban et ah, J. Bacteriology 143, 971-980 
(1980)- made by Minton,. Gene 31, 269-273 (1-984)-. The^ modified . 
vectors have additional unique cloning sites available in all 
• three reading frames. 

25 

(b) Cloning fowlpox sequences Into translational fusion 
* vectors. 

Random M13 subclones generated for sequencing purposes were 
used to place test sequences upstream of the" lac Z gene. Ml 3 
30 clones which started just downstream of" an ATG codon and ran in 
an upstream direction (Into the putative promoter) were 



c 
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selected. ' Fragments were excised from the clones, using 
restriction enzymes sites In the Ml 3 polyl Inker, and cloned Into 
pNM vectors cut with suitable restriction enzymes. The 
appropriate clone was chosen so that relatively little of the FPV 

05 gene ORF was present In the fused protein, and the appropriate 
vector was chosen so that the few amino acids encoded by the FPV 
ORF were 1n frame with the lac Z gene. For that reason the 
vectors differed by one or two nucleotides and are designated pNM 
480. 481 and 4~g2. Plasml'ds containing fowlpox sequences wfrlch 

10 had generated a complete lacZ gene were Identified tentatively 
either by a blue colour on bacterial plates or by probing with 
radiolabeled fowlpox DNA, and definitively by sequencing across 
the fusion site and into the putative promoter. Figure 2 shows 
how all of these clones (except number 1) were cloned into the 

15 pNM vectors. (Because the only suitable' M13 clone for 0RF1 was 
In a different orientation, different restriction enzymes had to 
be used. The pttM 480 plasmid was cut by BamHI. and Hind lH. using 
a BamHI site between the EcoRI and Hind lH sites marked, and the 
HindlH site was end-repaired appropriately to accommodate the 

20 BamHI - Haelll promoter fragment excised from the M13 vector). 
Table 2 gives a list of the ORFs Involved, the name of the Ml 3 
clone used, the pNM vector used and the number of amino acids 
encoded by the fowlpox ORFs (i.e. from the starting methionine 
codon onwards) participating In the fused products. 
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TABLE 2' 



Translatlonal fusion contructs of promoters (plus part of the 
ORF) with the lacZ gene construct. 



Nucleotide 





Starting . 


Final 


No. amino 


length of 5'- 


ORF 


pNH vector 


construct 


acids of 


non-cod1ng 


ref. 


ref. 


vector ref. 


ORF 


seauence 


1 


pNM 480 


• 

pNMGF32 


20 




2 


pNM 481 


pNMGJl 3M 


7 




3 


pNM 481 


pNMGE23 


3 




4 


pNM 482 


. pNMGA5 


13 




5 


pNH 482 


pNMGK4 


10 


189 


6 


pNM 480 


pNHGF6 


9 




7 


pNM 482 


pNHGB86 


13 




7 


pNM 480 


pNMSAU4 


2 




8 


pNM 482 


pNMGC44 


14 


395 


10 


pNM 481 


pNHGF7 


3 


(not yet known! 


11 


pNH 480 


pNHGLS 


37 




12 


pNM 482 


pNMGF78 


103 




FP4b 


pNH 481 


pNH4b30 


34 


283 


FP4b 


pNM 481 


pNM4b31 


21 


292 



(c) Testing promoters In a transient assay system. 

Chicken embryo fibroblast cells (CEFs) seeded In 24-cell 
05 tissue culture dishes (Llnbro) were Infected with fowlpox virus 
strain HP441 when the cells were '80-90% confluent. At various 
times after Infection ONA was Introduced Into the cells by a 
calcium phosphate transfectlon procedure. The system was 
optimised with respect to multiplicity of infection, times for 
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DNA transfectlon and quantity of DNA.for transfection, using the 
plasmld pMH6 which contains the vaccinia 1 1 K promoter fused to 
the beta-galactosldase gene which was found , to express 
beta-galactosldase activity In this transient assay system In 

05 FPV-lnfected cells. Although there was variation between 
Individual experiments, the technique adopted, when Internally 
controlled with pMH6 as a positive, and plasmld containing 
Irrelevant, sequences as a negative.,, worked, consistently. 

Cells In 24-well plates were Infected at 1 pfu of FPV HP441 

10 per cell. Precipitates were prepared In 96-well plates by adding 
ingredients in the following order: pNM plasmid DNA (0.2pg-5pg) 
plus lpg FPV "helper" DNA, 100^1 HEPES buffered saline (pH 7.12), 
and finally 7pl of 2M CaCl2- The plates were tapped gently to 
mix the contents, then left at room temperature for 20-30 minutes 

15 until a just visible, fine precipitate developed. 24-well plates 
of cells to be transfected were pre-washed with HEPES-buffered 
saline at room temperature, then the appropriate precipitate 
added at 4 or 20 hours after infection of the cells. After 30 
minutes at room temperature the excess precipitate was removed 

20 and 0.5ml 199 medium containing 5VCS was added. The transfected 
cells were reincubated as normal for a further 48 hours. 
Beta-galactosldase activity was assayed as follows. 
The tissue culture medium was carefully removed by 
aspiration, and the cells resuspended In 50^1 of 0.25M TRIS-HC1 

2.5, pH 7.5. 5mM dl thlothreitol (DTT>. The resuspended cells, were, 
freeze-thawed three times then transferred to 96-well plates for 
assay of beta-galactosldase content. To each lysate was added 
Ipl of a buffer containing 60mM Na2HP04, 40mM NaH2P04, lOmM KC1 , 
ImM MgCl2. 50mM 2-mercaptoethano! and lOOyl of 2mg/ml ortho- 

30 nitrophenylgalactose (ONPG) In 60mM Na2HP04, 40mM NaH 2 P04. ONPG 
Is a colorimetrlc substrate for beta-galactosldase which changes 
from colourless to yellow. The assay was Incubated for up to 2 
hours at 37*C until colour developed, then lOOul of 2H Na2C03 was 
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added to stop the reaction. The intensity of the yellow colour 
was determined by measuring the absorbance at 405nm of each well 
in an ELISA plate reader. 
RESULTS OF PROMOTER ASSAYS. 

05 The sequences from in front of the eleven largest major ORFs 

from pMH23 and from in front of the FP4b gene (see above) have 
been cloned into translational fusion. vectors (vectors containing 
the lacZ gene) and their activity as promoters measured in a 
transient assay, system.. Table 2- above- g»Wes a 11 sfr of th-ese 

10 constructs. Of the 14 FPV promoter constructs tested, five were 
found consistently to have promoter activity. These were the two 
FP4b constructs, the 0RF8 C13.2K gene) promoter, the 0RF5 (12. 5K 
gene) promoter and the ORF10 (33. OK gene) promoter. All these 
are promoters of the invention. The remainder of the constructs 

15 had lower levels of activity. Table 3 shows the results of three 
experiments. An asterisk denotes a construct containing a 
promoter of the invention. 



. TABLE 3 

Measurement of promoter . strength 1n assay for beta-galactosidas 
using a colorlmetrlc substrate (* - according to the Invention) 



Expe 



^rlment A. 










OPTICAL DENSITIES at 


405nm 


* 

• 




Final 






• 


ORF 


Construct 


Amount of DNA added 


20 hours p 


ref. 


Vector ref. 


P.2vg. 


1 .Oug 


5.0vq 


1 


pNMGF32 


0.011 


0.057 


0.04 


2 


not done 








3 


pNMGE23 


0.013 


0.066 


0.024 


4 


not done 








5 


pNMGK4 


0.026 


0.098 


0.103 


6 


pNMGF6 


0.047 


0.093 


0.057 


7 


pNHGB86 


0.031 


0.079 


0.027 


7 


pNMSAU4 


0.024 


0.065 


0.016 


8 


pNMGC44 


0.033 


0.129 


0.248 


10 


pNMGF7 


0.027 


0.071 


0.138 


11 


pNMGL8 


0.03 


0.052 


0.062 


12 


PNMGF78 


0.04 


0.063 


O.C69 


FP4b 


pNH4b30 


0.065 


0.197 


0.310 


FP4b 


pNM4b30 


0.057 


0.203 


0.260 
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Experiment B (DNA added earlier than In A) 

OPTICAL DENSITIES at 405nm 



Final 





ORF 


Construct 


Amount of DNA added 4 


hours | 




ref. 


Vector ref. 


0.2uq 


l.Oyq 


5.0 H | 




T 


pNMG'F32 


0.00 


0.01 


0.05 




2 


pNMGJ13K 


0.03 


0.00 


0.02 




3 


PNMGE23 


0.02 


0.03 


0.06 




4 


pNMGA5 


0.03 


0.39 


0.08 




5 


pNMGM 


0.18 


0.59 


0.89 




g 


DNMGF6 


0.01 


0.01 


0 02 




7 


pNHGS86 


0.00 


0.00 


0.04 




7 


pNMSAU4 


0.03 . 


0.04 


0.03 


* 


8 


pNHGC44 


0.11 


0.22 


0.71 


* 


10 


pNHGF7 


0.08 


0.10 


0.16 




11 


pNKGL8 


0.06 


0.05 


0.04 




12 


pN'HGF 78 


0.05 


0.05 


0.07 


* 


FP4b 


pNM4b30 


0.35 


0.27 


0.58 


* 


FP4b 


pNM4b31 


0.28 


0.32 


0.44 




Whole pMH23 


0.01 


0.02 


0.02 




No DNA 


0.01 


0.01 


0.01 



Experiment C (duplicate of B) 





OPTICAL 


DENSITIES 


at 405nm 






Final 








ORF 


Construct 


Amount of DNA added 4 hours | 


ref. 


Vector ref. 


O.Zpg 


1 .Oug 


5.0p< 


1 


PNMGF32 


0.00 


0.07 


0.04 


2 


pNMGJ13M 


0.02 


0.07 


0.08 


3 


pNMGE*3 


0.06 


0.01 


0.00 


4 


PNHGA5 


0.05 


0.00 


0.09 


5 


pNHGK4 


0.07 


0.13 


0.74 


6 


pNMGFS 


0.05 


0.05 


0.02 


7 


pNHGB86 


0.05 


0.07 


0.08 


7 


pNMSAU4 


0.04 


0.05 


0.05 


8 


pNMGC44 


0.05 


0.18 


0.65 


10 


pNMGF7 


0.02 


0.05 


0.31 


11 


PNMGL8 


0.03 


0.06 


0.09 


12 


phrnjr /o 


U.UJ 


A A-J 

y • vj 


A AO 
V • VL 


FP4b 


pNH4b30 


0.28 


0.30 


1.24 


FP4b 


pNM4b31 


0.11 


0.25 


1.18 



Whole pMH23 
Np DNA 



0.10 

0.0.3- 



0.06 
<U>A 



0.1C 
0.04 



■ X 

» 
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For experiment' A the ONA was added 20 hours post Infection, and 
for experiment B and C, which are essentially duplicates of each 
other, the DNA was added 4 hours post Infection. It Is 
Interesting to notice that some of the promoters appear to have 

05 higher activity when added early after Infection. For example at 
4 hours post Infection the 0RF5 promoter can give higher levels 
of activity than the 0RF8 promoter* whereas when It 1s added late 
It has lower levels. It may be that 0RF5 Is an early promoter 
which does not function well when, added relatively late In 

10 Infection. The ORF10 promoter, on the other hand, seems to 
function better when added later 1n Infection. Both the FP4b 
constructs give consistently high levels. 

Part of the sequences of the constructs used to test the 
FP4b, the 0RF8 M3.20, 0RF5 (12. S'<) and 0RF10 (33K) promoters 

15 are shown below. Each sequence starts and finishes with DNA from 
the pNM vector involved, and shows how the Intervening sequence 
Is made up from fowl pox sequences plus Ml 3 DNA. Two of the 
putative promoter sequences have been tested out in two separate 
constructs, each having different numbers of ORF amino acid 

20 coding sequence 1n the fused product. These are the 
FP4b30/FP4b31 pair and the pNHGB86/pNMSAU4 pair. In both cases 
the levels of promoter activity between the two different members 
of the pair were very similar, indicating that the length of 
fowlpox gene In the fused product is not critical. 



Part of the sequence of pNM4b30. 



pNM481 sequence ><- Ml 3 seque 

GCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATTACGCCAGAATTCGAG 
10 20 30 40 50 60 

>< — start of fowl pox (FP> sequence 

CTCGCCCTATTAACATTGCCTAGTAGTACTCCACTTTGGATAAGAAATCTGCATGATAAA 

70 80 90 100 110. 120 



TATATTGATATCCTACCACCTATTAAAGTACCATTATCTAATAGCAATAAGATAGATAAA 
130 140 150 160 .170 180 

CAAATGTTTTTTGATGAAGTTATTACGTGGATAAATATATATCTTCAGGAAAAGGGTATT • 
190 200 210 220 230 240 

ATGTTACCAGATGATATAAGAGAACTCAGAGATGCTATTATTCCTTAACTAGTTACGTCT 

250 260 270 280 290 300 

|— start of FP4b 
CTTTAGGTACTtATTTTGATACGTTACAAGTAAAAAACTATCAAATATAAATGGAATCTG 

HetGluSerAsp 

310 320 330 • 340 350 360 

ATTCTAATATAGCGATTGAAGAAGTTAAATATCCTAAfATTTTATTAGAACCTGTTTACT 
SerAsnlleAlalleGluGluValLysTyrProAsnlleLeuLeuGluProValTyrTyr 
370 380 390 400 410 -420 

end of FP sequence — >< — sequence from M13mpl0 — 
ATAATAACCTAGAAGTAATAGGATCTCATTTACGGGGATCCTCTAGAGTCGACCTGCAGC 
AsnAsnLeuGluVallleGlySerHlsLeuArgGlySerSerArgValAspLeuGlnPro 
430 440 450 460 470 480 

>< sequence from pNM481 (UcZ gene) — ' etc... 

CCAAGCT TGC TCCCC TGGCCOTCGT T T T ACAACGTCGTGACTGGG AAAACCCTGGCGT T 
L'ysteuAVaP roLeuA'l aVal Va VLe uG 1'nAr g Arg As pTYpGTu As nProGTyYat 
490 500 510 520 530 



Part of the sequence of pNM4b31 . 



pNM481 sequence ><- Ml 3 sequence 

GCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATTACGCCAGAATTCGAG 

10 20 30 40 50 60 • 

>< — start of fowl pox (FP) sequence 

CTCGCCCAGTCACAAGTATTAACATTGCCTAGTAGTACTCCACTTTGGATAAGAAATCTG 
70 80 90 100 110 120 



CATGATAAATATATTGATATCCTACCACCTATTAAAGTACCATTATCTAATAGCAATAAG 
130 140 150 160- 170 180 

ATAGATAAACAAATGTTTTTTGATGAAGTTATTACGTGGATAAATATATATCTTCAGGAA 
190 200 210 220 230 240 

AAGGGTAT7ATGTTACCAGATGATATAAGAGAACTCAGAGATGCTATTATTCCTTAACTA 
250 260 270 280 290 300 

I— start of FP4b 
gene 

GTTACGTCTCTTTAGGTACTTATTTTGATACGTTACAAGTAAAAAACTATCAAATATAAA 

Met 

310 320 330 340 350 360 

end of FP sequence — 
TGGAATCTGATTCTAATATAGCGATTGAAGAAGTTAAATATCCTAATATTTTATTAGAAC 
GluSerAspSerAsnlleAlalleGluGluValLysTyrProAsnlleLeuLeuGluPro 
370 380 390 400 410 420 

><— sequence from M13mpl0 >< — sequence from pNM481 

CTGGGGGATCC1CTAGAGTCGACCTGCAGCCCAAGCTTGCTCCCCTGGCCGTCGTTTTAC 
GlyGlySerSe. ArgValAspLeuGlnProlysLeuAlaProLeuAlaValValleuGIn 
430 440 450 460 470 480 

(lacZ) etc... 

AACGTCGTGACTGGGAAAACCCTGGCGTT 
Ar g Ar g As pTr pG 1 uAs nP roG 1 yVa 1 
490 500 



Part of the sequence of pNMGC44. 



< pNM482 sequence ><- Ml 3 se 

GCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATTACGCCAGAATTCGAG 
10 20 30 40 50 60 

>< — start of fowl pox (FP) sequence 

CTCGCCCTGAACTTTCCAGAATCTAATAATTCTTCCACTTTAACAACGTCTCCTTCTTCC 
70 80 90 100 110 120 



ACGGCCTCATCCAATTCAGATTCTATATCCGGATAGTTATAATCGGGATAAGTGTTGTAA 
130 140 150 160 170 180 

CTCATCAGTAATTTAATCATTTCAACATCTCTAAGTCTGACGGCCATCTTTATAGGCGAG 
190 200 210 220 230 240 

TATCCGTTGATAGTAAAATTCGGATTGATGTAAGAATCCAACAGGCGTCTAGCCACATCC 
250 260 270 280 290 300 

AGTTCTCCAAAGAGAATAGCATTGCAAAGTTCTACACGATCCATTGTATAATATAGGTGT 
310 320 330 340 350 360 

TCAACACCTCTCGATATATCATTATTTGTTTTTTCAATTTTATTATAAGTAGTTTGAATG 
370 380 390 400 410 420 

j— start of ORF 8 gene 
CATTTTTAAGTTTAATAAATCTTGATAAAGTATATTTAAAAAATGGAGGAGGGTAAACCG 

MetGluGluGlyLysPro 
430 440 450 460 470 480 

>< — sequence from Ml 3mpl 0 >< — 

CGACGTAGTAGCGCAGTATTATGGGGGGATCCTCTAGAGTCGACCTGCAGCCCAAGCTTC 
ArgArgSerSerAlaValLeuTrpGlyAspProLeuGluSerThrCysSerProSerPhe 
490 500 510 520. 530 540 

sequence from pNH482 (lacZ gene) etc... 

GATCCCCTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTT 
AspProLeuAlaValValLeuGlnArgArgAspTrpGluAsnProGlyVal 
550 560 570 580 590 
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Part of the sequence of pNMGK4. 

ICIKICEICSKICCIICCSSECSKICCKSIf 



1 , pNM482 sequence ><- HI 3 sequence — 

GCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCC7CTTCGCTATTACGCCAGAATTCGAG 
10 20 30 40 50 60 

>< — start of fowl pox (FP> sequences 

CTCGCCCGAATAAAGATTCTAAATCTACGCACGTAATAAACCAAATATACTAAAATATAA 
70 80 90 100 110 120 



AATTATGCCGCGGGATGATAAGATACTTCAGATGATCGTGATGAACTATATTTATTAATT 
130 140 150 160 170 180 

GGCAATACTTAAAAATAATGTTTATAACATATGTAAATATAATAAACAATAATTTAGATT 
190 200 210 220 230 240 

1— start of 0RF.5 gene >< — " sequence from M13mp10 — 

TTTAAAATGATAATACGTAGGAATAATAAAGCTCTTGGGGATCCTCTAGAGTCGACCTGC 
MetllelleArgArgAsnAsnlysAlaLeuGlyAspProLeuGluSerThrCys 
250 260 270 280 290 300 

><— - sequence from pNM482 (lacZ gene) 

AGCCCAAGCTTCGATCCCCTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGC 
SerProSerPheAspProLeuAlaValValLeuGlnArgArgAspTrpGluAspProGly 
310 320 330 340 350 360 

— etc . . . 

GTT 

Val 



* 



Part of the sequence of pNMGF7, 



pNM481 sequence — ><- M13 seqi 

GCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATTACGCCAGAATTCGAG 
10 20 30 40 50 60 

><— start of fowlpox (FP) sequence (exact left end unknown) 

CTCGCCCXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXCTGACAAAATTAGATGACATGAT 
70 80 90 100 110 120 

— FP' sequence continued 

CTTACGAAATCTACTACTACAAAATGAGACAAATAATATTAAAAATTGTAAAAAACAGTA 
130 140 150 160 170 180 

ATATAAATATGATAAGAAAATGTGTAAAAAAGCTAGGAAACGCGGTCTATTAACGATAGC 
190 200 210 220 230 240 

TTTCACTATATTGTTATTTGTTATTAT7TTAGTAGATATAGACAGAGATAGATATTTAGT 
250 260 270 280 290 300 

AAGGTGTGGTAAAGACTGGTTAGAATTCGATAATTTATGTTATTTTATTTCCGAAAATAA 
310 320 330 340 350 360 

GTTAAGTTGGGATGATAGCATGATGGTATGTGATAATCTTGGCGGTGGGAATAATATTAA 
370 380 390 400 410 420 

CATAAATACGAATAGTGGCTTATTAAATACATCTAAGGACTATTGGATAAAAATAGTAGA 
430 440 450 460 470 480 

CGAACTAGATTGTACAAATATTAATATGTGTAATTTCTTATATAGTAATATAGTAGGATG 
490 500 510 520 530 540 

TGATATATGCACCATAGAAAAATTTTATATTTGTATAAAACCGATAAATAAAATAAACTT 
550 560 570 580 590 600 

|i-0RF.10><- seqoen'ce* from Mnmpl'O- 
ATTTAGTTACTTTGTAGAGTATACTAAATAATAATGAAATTTAGGGGATCCTCTAGAGTC 

MetLysPheArgGlySerSerArgVal 
610 620 630 640 650 660 

— >< sequence from pNM481 (lacZ gene) — 

GACCTGCAGCCCAAGCTTGCTCCCCTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAAC 
AspLeuGlnProLysLeuAlaProLeuAlaValVaUeuGlnArgArgAspTrpGluAsn 
670 680 690 700 710 720 

etc ... 

CCTGGCGTT 
ProGlyVal 



INSERTION OF GENES INTO FOHLPOX VIRUS 

Foreign genes are Introduced Into the virus by a process of 
homologous recombination. This process has been described In the 
literature In detail for vaccinia virus and an analogous 
05 procedure can be used for fowlpox virus. 

1 . Infection with virus and transfectlon of DNA. 

25- cm 2 bottles of CEF cells at about 80% confluence are 
Infected with about 10 7 pfu (about 3 pfu/cell) of an attenuated 
strain of fowl pox virus 1n lmT of serum-free medium. The bottles 

10 are Incubated at 37'C for 2 hours with occasional gentle 
agitation then 5ml of 199 medium (Gibco) with 5% calf serum are 
added and the cells are Incubated for a further 2 hours at 37'C. 

30 minutes before this 2 hours is up the DNA/C3PO4 
precipitates are prepared. 20pg of plasmid DNA, which contains a 

15 "type 1 construct" and therefore includes non-essential regions 
of FPV plus 2ug of fowl pox "helper" DNA are added to 1ml of HEPES 
buffered saline (HBS) pH 7.12 1n a plastic bijou (HBS Is 0.818% 
NaCl (w/v), 0.594% HEPES (w/v), 0.021 Na2HP04 anhydrous (w/v), 
adjusted to pH 7.12. with 1H NaOH). 66yl of 2M CaCl2 1s added 

20 slowly down the side of the bijou. This is left at room 
temperature for 20 to 30 minutes for a fine precipitate to form. 

After the 2 hours Incubation, the cells are washed twice with 
HBS at room temperature and the precipitate is gently added to 
the cells. This Is left at room temperature for 40 minutes and 

26 then 5m1 of 199 medium with 5% calf serum is added and the cells 
are Incubated at 37 'C for 3 to 4 hours. The medium Is then 
changed for fresh medium. 

2. Detection of recombinants . 

After the virus has been allowed to grow In the cells for 3-5 
30 days (this Is when a complete cytopathic effect can be seen) the 
cells plus supernatant are harvested and freeze thawed three 
times. The progeny virus Is then plaqued on CEF cells at about 
500-1000 plaques per 10cm petrl dish and an overlay of medium 
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containing IX low gelling temperature agarose- is added. The 
plaques are then lifted onto nitrocellulose and probed with DNA 
from' the foreign gene which is being inserted Into the fowlpox 
virus by the method of L. Villareal et an., Science 196. 183-185 
05 (1977). Plaques which are found to light up with the probe are 
picked from the agarose overlay (which has been stored at 4*C>, 
freeze-thawed three times and replaqued. The plaques are then 
probed again with the foreign DNA tp confirm thajt, the recombinant 
virus has been isolated successfully from the agarose overlay. 

10 INFECTION OF CHICKENS HITH THE RECOMBINANT VIRUS 

Twenty two chicks, 5 days old, are placed In a container 
(46 x 46 x 58cm. 3 ) A spray gun Is used to create a fine aerosol 
using 80 ml. water containing 1.5 x 10** p.f.u." of virus grown in 
chicken . embryo fibroblast cells. This vaccination is repeated 

15 when the "chicks are 26-days old. 

EXAMPLE 2 

Promoters are signals 1n the viral DNA which direct 

n unm i^biwn vi i\m/i • w hi wm j |/i wow b w i ^ n i i i w*ct ci wt w witvvh 

transcription of greater amounts of RNA than weak promoters. 

20 This Is used as a way of Identifying efficient promoters. If 
radiolabel led viral RNA is hybridised to restriction fragments of 
viral DNA, Immobilised on a nitrocellulose filter, particular 
regions of the virus containing Strong, promoters, might be 
Identified. For late RNA this might be expected' to be difficult 

25 since late RNA transcripts are known to run well past the end of 
their genes, possibly Into adjacent restriction fragments, hence 
confusing any attempts at mapping. . However for early RNA It 
should be a useful approach. ('Early' RNA is RNA made before DNA 
replication and 'late' RNA is made after DNA replication, by 

30 definition. RNA made even earlier, i.e. before protein 
synthesis, can be referred to as 'Immediate early RNA'). A 
convenient method of making radiolabel led RNA of the immediate 
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early class is to use a jn vitro system containing purified 
virus, deoxynudeoslde triphosphates, one of which is 
radloactively labelled, and a suitable buffer. This has been 
described for vaccinia virus by S. Venkatesan & B. Moss. J. 
05 Virology 37 738-747 (1981) and It is found that the RNA produced 
\n vUro (I.e. In a test tube) 1n this manner has the same 
pattern as that made U» vWo (I.e. In tissue culture). 

METHODS 

Virus purification . 

10 Virus was grown In chick embryo fibroblast (CEF) cells and 
purified as follows: Forty 75cm 2 flasks of CEFs were infected 
with 5 x 10 6 pfu/flask of PP9 (a plaque-purified isolate of 
HP440). The flasks were Incubated at 37*C for 5 days. The cells 
were then shaken off into the medium and then spun down at 7,000 

l'-i rpm for 15 minutes. The supernatant containing the virus was 
then centrifuged at 15,000 rpm for 3C minutes at 4 4 C. The virus 
pellets were pooled and resuspended in 40ml phosphate-buffered 
saline (PBS). This was layered onto a cushion of 10ml of 35% 
(w/v) sucrose and centrifuged at 15,000 rpm for 30 minutes. The 

20 viral pellet was then resuspended In 1ml of PBS. This was then 
layered onto a 20-50% (w/v) sucrose gradient and centrifuged at 
15,000 rpm for 30 minutes. The two viral bands were collected, 
pooled, layered onto two 20-60% metrlzamlde gradients (about 1ml 
per gradient) and 7 centrifuged at 30,000 rpm for 18-20 hours. The* 

25 viral band was then collected (1ml per gradient). 

In vitro synthesis of labelled RNA. (based on the method of S. 
Venkatesan & B. Moss. 1981 Joe. cU.) 

10 9 pfu of purified virus particles from the above procedure were 
used as follows to produce labelled RNA. The virus solution was 
20 made to 0.05% Nonldet P-40 (NP-4C) and left on Ice for 1 hour. 
This was then added to a solution containing 50mM Tris-HCl (pH 
. 8.5). 10mM dlthiothreitol, 5mM ATP. ImM each of GTP and CTP. 10mM 
MgCl2. 100uM S-adenosylmethlontne (AdoMet), and lOOpCl of 
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32p_iabe1 led UTP. the total volume being 5ml. After 30 minutes 
at 37'C fresh AdoMet (the same amount again) was added and the 
reaction Incubated for a further 30 minutes. The reaction was 
terminated by addition of EDTA to lOmM, and the tubes were placed 

05 on Ice. The virus was then pelleted by centrlfugatlon at 30,000 
rpm for 30 minutes, the labelled RNA being contained In the 
supernatant. -To the supernatant was added sodium dodecyl 
sulphate (SOS) to a final concentration of 0.25X and the mixture 
extracted with an equal volume of phenol saturated In TE (lOmM 

10 TRIS-HC1, pH 7.5, ImM EDTA). The aqueous 'layer was removed and 

" extracted with diethyl ether and the RNA precipitated by addition 
of 1/10 volume of 3M sodium acetate and 2.5 volumes of ethanol . 
The RNA was spun down at 15,000 rpm for 10 minutes and the pellet 
resuspended In 4ml of guanldine thlocyanate solution <6M 

15 guanidlne thlocyanate. 0.5% sodium N-1aurylsarcosine. 5mM sodium 
citrate, 0.1M 2-mercaptoethanol). This was layered onto a 1ml 
cushion of CsCl/EDTA (5.7M GsCl , 0.1H EDTA) and centrifuged a.t 
38.000 rpm for 18-20 hours at 18'C to pellet the RNA. The 
supernatant was carefully removed and discarded and the RNA 

20 pellet resuspended In SQOyl of diethyl pyrocarbonate-treated 
water. 

Hybridisation to DNA. 
a) Restriction digests 

An EcoRI digest. o.f FPV CiNA,. and a BamHI/EcoRI digest of the 

25 11.2kb BamHI clone were separated on 0.91 agarose gels. The DNA 
was transferred to nitrocellulose filters by Southern blotting. 
Single-stranded preparations of M13 clones from the 11.2kb 
fragment were spotted onto nitrocellulose and baked for 2 hours 
at 80'C in a vacuum (1/10 of the DNA from a 1ml culture). The 

30 filters were prehybridlsed In 10ml of 5 x SSC (SSC Is 0.15M NaCl. 
0.015M Sodium-citrate) for 2 hours at 60*C. The suspension of 
labelled RNA being used as a probe was boiled for 3 minutes 
before addition to .the filters. We. probe and filters were 
Incubated, with shaking, at 60'C for 18-20 hours. The filters 
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were washed In 2 x SSC, 0.1% SDS at 42'C for 30 minutes, then in 
0.1 x SSC, 0.1% SDS at 25C for 30 minutes, and thereafter exposed 
to X-ray film. 

RESULTS . 

05 The. labelled viral RNA was found to hybridise strongly to 

only two EcoRI fragments In the digest of FPV DNA. One was 790bp 
long and the other was 3830bp. (Some larger sized bands, 
particularly in the region of about 6,000bp, hybridised weakly). 
The . RNA also hybridised to a 3830bp band In the EcoRI/BamHI 

10 digest of the 11.2kb BamHI fragment. Labelled EcoRI FPV DNA 
fragments of sizes 790bp and 3830bp, purified from an agarose 
gel, were used to probe,, by the well-known method of Grunstein & 
Hogness, an Eco RI library of FPV DNA fragments cloned into 
pUC13. Several pUCl 3 clones were thus Identified which were also 

15 probed with the labelled in vitro RNA. The resulting group of 
pUCl 3 clones proved to fall into two categories, those with viral 
inserts of 790bp in size and those with inserts of 3830bp in 
size. The 3830bp-sized clones were probed with labelled 3830bp 
fragment from the 11.2kb BamHI fragment (nucleotides 6162 to 9992 

20 : the EcoRI sites are underlined) and were found to be the same. 
The 3830bp fragment includes the whole of the strongly promoted 
0RF8 and 0RF10 genes. Also, approximately 120bp of sequence from 

■ 

each end of the 790bp clone have been determined (see below). 
Using, this 790bp clone and the sequence info/mat ion given bel.ow, 

25 the 5 "-end of this gene and thence the promoter region can 
readily be identified. The following is the partial sequence 
determined from near both ends of the 790bp fragment. (A few 
nucleotides from each end have not oeen sequenced). The 
numbering above- 680 is approximate as the exact length of the 

30 fragment Is not known. N- a nucleotide not yet determined. 
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TGTCATCATA TCCACCTATA AATGTAATAT AATTAGCGCC TGATTGTGTC GATACATTAT 
10 20 30 40 50 60 

CGGGTGAAAA GTCCACCGTA ATATTGCTTT TATCGGTTGT ATTTACCACG TATAC 

70 80 90 100 no 

sequence not yet determined — GTTCT 

680 

TTTTCATTTT TAATGTACGT TATTTTGTAA TAATGTTTAT ATAAATTACC ATACTTTANN 
690 700 710 720 730 740 

NA7TATAAAT ATTGA*A'G ; TA"A" A'A'GA'ATAGTt T'A'AA'T'TACCT A^CA'TAGA'AC" ATtA* 
750 760 770 780 790 

b> Ml 3 clones from the 11.2M) fragment. 

A series of single-stranded Ml 3 clones from the 11.2kb BamHI 
fragment were- spotted onto nitrocellulose. Clones were chosen so 
that each major open reading frame (ORF) in the fragment was 
represented by one clone in the same orientation as the expected 
RNA from that ORF (i.e. unable to hybridise to the RNA) and one 
clone 1n the opposite orientation (i.e. expected to hybridise to 
RNA from that ORF). The clones were as follows. 
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ORF Clone Nucleotide No. Expected 

reference Start Finish to hybridise? 

<♦ - YES; - - NO) 



1. 


(416-1674) 


GC47 


407 


725* 








GC50 


860 


545* 


- 


2. 


(2166-2671) 


GB53 


2682 


2887* 


♦ 






GF18 


2639 


2581* 


- 


3. 


(4055-3606) 


G045 


3706 


3918* 








GA28 


3887' 


3627* 


♦ 


4. 


(4170-4594) 


GF48 


4096 


4305* 








GF95 


4481 


4228* 


- 


5. 


(5138-4821) 


GF73 


5078 


5404* 








GG2 


5041 


4727* 


♦ 


6. 


(5974-5519) 


GE3 


• 5604 


5821* 








GF110 


5824 


5601* 


♦ 


7. 


(7906-6674) 


GC59 


7000 


7290* 








GC61 


7283 


7005* 




8. 


(8025-8376) 


GF74 


7977 


8238 


♦ 






GB150 


8351 


8085* 


- 


q 


(8632-8837) 


MFP344 


R7R1 
o / O 1 


ftQftfi* 
OJO\J 








GJ24 


8785 


8584 




10. 


(9686-8844) 


GC43 


9277 


9499* 








GB84 


S495 


9230* 




11. 


(10120-9689) 


GC45 


9813 


10066* 








GB161 


10107 


9828 




12. 


(10705-10139) 


GB64 


10359 


10571* 








GF21 


10584 


10276* 


♦ 



* This Is not the actual end of the clone, but merely the point 
up to which it was sequenced. 
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RESULTS 

Only the following clones hybridised to the in vitro RNA: 
GG2 very strongly (ORF 5 promoter) 
GC61 weakly 

05 GJ24 very strongly (despite the fact that It Is a "same 
orientation" clone) 
GB84 moderately strongly (ORF 10 promoter) 

These results give a reasonable confirmation of the use of 
the RNA transcription method of Identifying an Immediately early 

10 strong promoter. Thus, the clones containing the ORF 5 and ORF 
10 promoters hybridised strongly to the mRNA. No signal was 
obtained from the clone containing the ORF 8 promoter, presumably 
because It does not act at the Immediate early stage. The strong 
hybridisation of GJ24 (nucleotides 8785 to 8584) 1s probably a 

15 result of the mRNA transcribed for the ORF 10 gene (nucleotides 
9686 to 8844) running beyond the end of the gene at 8844, well 
into the DNA which encodes ORF 9 (8632 to 8835). 

It follows that when an Immediate early promoter is required, 
the ORF 5, ORF 10 and "790 bp" promoters appear likely to be the 

20 only good choices. 
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CLAIMS 

1. Fowlpox virus <FPV) promoter ONA, for promoting the 
transcription of a foreign gene Inserted In a FPV vector, said 
DNA comprising the promoter of any one of the following FPV DNA 
genes and consisting substantially of sequence to the' 5'-end of 
05 said gene which Is non-coding for said gene and up to 150 
nucleotides long: 

(1) The FP4b gene which encodes a protein of about 657 amino 
adds 1n a sequence beginning 

Met Glu Ser Asp Ser Asn He Ala He Glu 
10 Glu Val Lys Tyr Pro Asn He Leu Leu Glu 

or a variation of such sequence; 

(2) The BamHI fragment 0RF8 gene encoding a protein of about 
116 amino acids In a sequence beginning 

Met Glu Glu Gly Lys Pro Arg Arg Ser Ser 
15 Ala Val Leu Trp Met Leu He Pro Cys Gly 

or a variation of such sequence; 

(3) The BamHI fragment 0RF5 gene encoding a protein of about 
105 amino acids in a sequence beginning 

Met lie He Arg Arg Asn Asn Lys Ala Leu 
20 Gly Ser Val Met Ser Asp Phe He Lys Thr 

or a variation cf such sequence; 

(4) The BamHI fragment 0RF10 gene encoding a protein of 
about 280 amino acids in a sequence beginning 

Met Lys Phe <-ys Glu Val Arg Asn Thr He 
2S Lys Lys Met Asrf He Thr Asp lie Lys He 

or a variation of such sequence; and 

(5) The gene of which the coding stand hybridises strongly 
to FPV RNA and Is at least partly located within an 
approximately 790 bp DNA sequence, containing near Its 5'-end 

30 the sequence: 

(5') TGTCATCATA TCCACCTATA AATGTAATAT and near Its 
3' -end the sequence: 

AAGAATAGTC TAAATTACGT AACATAGAAC ATCAT <3'>. 
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2. FPV promoter DNA according to Claim 1 wherein the non-coding 
sequence is of length up to 100 nucleotides Immediately preceding 
the start codon of the gene. 

3. FPV promoter DNA according to Claim 2 wherein the non-coding 
05 sequence Is of length up to 80 nucleotides immediately preceding 

the start codon of the gene. 

4. FPV promoter DNA according to Claim 2, within any one of the 
following sequence of 100- nu&leotldes 1mtned4a«t«ely preceding the 
start codon of the gene, as follows :- 

10 FP4b (5') TATTACGTGG ATAAATATAT ATCTTCAGGA AAAGGGTATT ATGTTACCAG 
ATGATATAAG AGAACTCAGA GATGCTATTA TTCCTTAACT AGTTACGTCT 
CTTTAGGTAC TTATTTTGAT ACGTTACAAG TAAAAAACTA TCAAATATAA 

<3'> 

0RF8 <5'> AGAATAGCAT TGCAAAGTTC TACACGATCC ATTGTATAAT ATAGGTGTTC 
15 AACACCTCTC GATATATCAT TATTTGTTTT TTCAATTTTA TTATAAGTAG 

TTTGAATGCA TTTTTAAGTT TAATAAATCT TGATAAAGTA TATTTAAAAA 

(3'> 

0RF5 (5') TAAACCAAAT ATACTAAAAT ATAAAATTAT GCCGCGGGAT GATAAGATAC 
TTCAGATGAT CGTGATGAAC TATATTTATT AATTGGCAAT ACTTAAAAAT 
20 AATGTTTATA ACATATGTAA ATATAATAAA CAATAATTTA GATTTTTAAA 

t3'> 

0RF10 (5')ACTAGATTGT ACAAATATTA ATATGTGTAA TTTCTTATAT AGTAATATAG 
TAGGATGTGA TATATGCACC ATAGAAAAAT TTTATATTTG TATAAAACCG 
ATAAATAAAA TAAACTTATT TAGTTACTTT GTAGAGTATA CTAAATAATA 

2.5. cy* 

or within a variation of such sequence. 

5. A recombination vector comprising a cloning vector 
containing, as an insert, a non-essential region (NER) sequence 
of FPV, said NER being interrupted by DNA comprising (a) promoter 

30 DNA according to Claim 1, 2, 3 or 4 followed by <b> a foreign 
gene transcribable by the promoter. 

6. A recombination vector comprising a cloning vector 
containing, as an insert, in order: 

(1) a first homologously recomblnable sequence of the 
35 fowl pox virus (FPV) genome. 
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(2) a sequence within a first portion of a non-essential 
region (NER) of the FPV genome, 

(3) TPV promoter DNA according to Claim 1, 2, 3 or-4. 

(4) a foreign gene transcrlbably downstream of the promoter 
05 (whereby when the fowlpox virus RNA polymerase binds to the 

promoter It will transcribe the foreign gene Into mRNA), 
<5> a sequence within a second portion of the same NER of 
the FPV genome, the first and second sequences being In the 
same* rsl*t'1ve> orientation as are; the f4r*t and second 

10 portions of the NER within the FPV genome, and 

(6) a second homologously recomblnable sequence of the FPV 

1 genome, said sequences (1) and (6) flanking the NER In the 

FPV genome and being In the same relative orientation 1n the 

15 acomb1nat*on vector as they are within the. FPV genome. 

7. A DNA cassette which comprises a FPV promoter according to 
Claim 1, 2, 3 or 4, transcrlbably linked to a foreign gene. 

8. A recombinant cloning vector containing a DNA cassette 
according to Claim 7. 

20 9. A recombinant fowlpox virus (FPV) which Is the product of 

homologous recombination of a parent FPV with the Insert DNA of a 
.recombination vector according to Claim 5 or 6. 

10. An In vitro culture of animal cells Infected with a virus 

claimed in Claim 9. 
25 11. A culture according to Claim 10 wherein the animal cells are 

chicken cells. 

12*. A method of vacci'natVng' a responsive animal-, which comprises* 
Inoculating It with a recombinant FPV as defined In Claim 9. 
13. A method according to Claim 12 wherein the animal is a 
chicken. 
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